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DNA SEQUENCES , RECOMBINANT DNA MOLECULES 
AND PROCESSES FOR PRODUCING SOLUBLE 

T4 PROTEINS 



5 TECHNICAL FIELD OF INVENTION 

This invention relates to DNA sequences, 
recombinant DNA molecules and processes for producing 
soluble T4 proteins. More particularly, this invention 
relates to DNA sequences that are characterized in 

10 that they code on expression in an appropriate uni- 
cellular host for soluble forms of T4, the receptor 
on the surface of T4* lymphocytes, or derivatives 
thereof. In accordance with this invention, the DNA 
sequences, recombinant DNA molecules and processes 

15 of this invention may be employed to produce soluble 

T4 essentially free of other proteins of human origin. 
This soluble protein may then advantageously be used 
in the immuno therapeutic, prophylactic, and diag- 
nostic compositions and methods of this invention. 

20 The soluble T4 protein-based immunothera- 

peutic compositions and methods of this" invention 
are useful in treating immuno def icient patients suf- 
fering from diseases caused by infective agents whose 
primary targets are T4 + lymphocytes. According to a 

25 preferred embodiment, this invention relates to solu- 
ble T4 protein-based compositions and methods which 
are useful in preventing, treating or detecting 
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acguired immune deficiency syndrome, AIDS related 
complex and HIV infection. 

BACKGROUND ART 

The class of immune regulatory cells known 
5 as T cell lymphocytes can be divided into two broad 
functional classes, the first class comprising T 
helper or inducer cells — which mediate T cell pro- 
liferation, lymphokine release and helper cell inter- 
actions for Ig release, and the second class compris- 

10 ing T cytotoxic or suppressor cells — which parti- 
cipate in T cell-mediated killing and immune response 
suppression. In general, these two classes of 
lymphocytes are distinguished by expression of one 
of two surface glycoproteins: T4 (m.w. 55,000-62,000 

15 daltons) which is expressed on T helper or inducer 
cells, probably as a monomer ic protein, or T8 (m.w. 
32,000 daltons ) which is expressed on T cytotoxic or 
suppressor cells as a dimeric protein. 

The primary structures of T4 and T8 have 

20 been deduced from their respective cDNA sequences 

[P. J. Maddon et al. , "The Isolation and Nucleotide 
Sequence Of A cDNA Encoding The T Cell Surface Protein 
T4: A New Member Of The Immunoglobulin Gene Family", 
Cell , 42, pp. 93-104 (1985); D. R. Littman et al., 

25 "The Isolation And Sequence Of The Gene Encoding T8: 
A Molecule Defining Functional Classes Of T Lympho- 
cytes", Cell , 40, pp. 237-46 (1985)]. Both predicted 
protein sequences define molecules with domains 
expected for surface antigens -,- including transmem- - 

30 brane and intracytoplasmic domains at the carboxyl 

"end of the protein. In addition, both proteins con- 
tain an amino terminal region which shows striking 
homology to immunoglobulin and T cell receptor 
variable regions and which might function during 

35 target cell recognition [ Maddon et al ■ , supra ] . 
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In immunocompetent individuals, T4 lympho- 
cytes interact with other specialized cell types of 
the immune system to confer immunity to or defense 
against infection [E. L. Reinherz and S. F. 
5 Schlossmann "The Differentiation Function Of Human 
T-Cells", Cell , 19, pp. 821-27 (1980) J. More speci- 
fically, T4 lymphocytes stimulate production of growth 
factors which are critical to a functional immune 
system. For example, they act to stimulate B cells, 

10 the descendants of hemopoietic stem cells, which 
promote the production of defensive antibodies. 
They also activate macrophages ("killer cells") to 
attack infected or otherwise abnormal host cells and 
they induce monocytes ("scavenger cells") to encompass 

15 and destroy invading microbes. 

It has been found that the primary target 
of or receptor for certain infective agents is the 
T4 surface protein. These agents include, for 
example, viruses and retroviruses. When T4 lympho- 

20 cytes are exposed to such agents, they are rendered 

nonfunctional. As a result, the host's complex immune 
defense system is destroyed and the host becomes 
susceptible to a wide range of opportunistic infec- 
tions. 

25 Such immunosuppression is seen in patients 

suffering from acquired immune deficiency syndrome 
("AIDS")". AIDS is a disease characterized by severe 
or, typically, complete immunosuppression and 
attendant host susceptibility to a wide range of 

30 opportunistic infections and malignancies . In some 

cases, AIDS infection is accompanied by central 
nervous system disorders. Complete clinical mani- 
festation of AIDS is usually preceded by AIDS 
related complex ( "ARC" ) , a syndrome accompanied by 

35 symptoms such as persistent generalized lymphadeno- 
pathy, fever and weight loss. The human immunode- 
ficiency virus ("HIV") retrovirus is thought to be 
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the etiological agent responsible for AIDS infection 
and its precursor, ARC [M. G. S .rngadharan et al . , 
"Detection, Isolation And Continuous Production Of 
Cytopathic Retroviruses (HTLV-III) From Patients 
5 With AIDS And Pre-AIDS", Science , 224, pp. 497-508 
(1984)] .* 

Between 85 and 100% of the AIDS/ARCS popu- 
lation test seropositive for HIV [G. N. Shaw et al., 
"Molecular Characterization Of Human T-Cell Leukemia 

10 ( Lympho tropic ) Virus Type III In The Acquired Immune 
Deficiency Syndrome 11 , Science , 226, pp. 1165-70 
(1984)]. The number of adults in the United States 
infected with HIV has been estimated to be between 1 
and 2.5 million [D. Barnes, "Strategies For An AIDS 

15 Vaccine", Science , 233, pp. 1149-53 (1986); M. Rees, 
"The Sombre View of AIDS", Nature , 326, pp. 343-45 
(1987)]. These estimates include 64,900 individuals 
who do not belong to an identified group at risk for 
AIDS [S. L. Sivak and G. P. Worms er, "How Common Is 

20 HTLV-III Infection In The United States?", New Eng. 
J. Med. , 313, p. 1352 (1985)]. The apparent annual 
rate of diagnosis for those infected with HIV virus 
is between 1 and 2% — a rate which may increase 
significantly in future years. 

25 The genome of retroviruses, such as HIV, 

contains three regions encoding structural proteins. 
The gag region encodes the core proteins of the 
virion. The pol region encodes the virion RNA-depen- 
dent DNA polymerase (reverse transcriptase). The 



30 



. * In this application, human immunodeficiency 
virus ( "HIV" ) , the generic term adopted by the human 
retrovirus subcommittee of the International Committee 
On Taxonomy Of viruses to refer to independent iso- 
35 lates from AIDS patients, including human T cell 
lymphotropic virus type III ( "HTLV-I I I" ) , lympha- 
denopathy-associated virus ("LAV"), human immuno- 
deficiency virus type 1 ("HIV-l") and AIDS-associated 
retrovirus ( "ARV" ) will be used. 
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env region encodes the major glycoprotein f und in 
the membran envel pe of the vijr-s and in the cyto- 
plasmic membrane of infected cells. The capacity f 
the virus to attach to target cell receptors and to 
S cause fusion of cell membranes are two HIV proper- 
ties controlled by the env gene. These properties 
are believed to play a fundamental role in the patho- 
genesis of the virus. 

HIV env proteins arise from a precursor 

10 polypeptide that, in mature form, is cleaved into a 
large heavily glycosylated exterior membrane protein 
of about 481 amino acids — gp!20 ~ and a smaller 
transmembrane protein of about 345 amino acids which 
may be glycosylated — gp41 [L». Ratner et al. # 

15 "Complete Nucleotide Sequence of The AIDS Virus, 
HTLV-III U , Nature, 313. pp. 277-84 (1985)]. 

The host range of the HIV virus is asso- 
ciated with cells which bear the surface glycopro- 
tein T4. Such cells include T4 lymphocytes and 

20 brain cells [P. J . Maddon et al., "The T4 Gene Encodes 
The AIDS Virus Receptor And Is Expressed In The 
Immune System And The Brain", Cell , 47, pp. 333-48 
(1986)]. Upon infection of a host by HIV virus , the 
T4 lymphocytes are rendered non- functional. The 

25 progression of AIDS/ARCS syndromes can be correlated 
with the depletion of T4 lymphocytes, which display 
the T4 surface glycoprotein. This T cell depletion, 
with ensiling immunological compromise, may be attri- 
butable to both recurrent cycles of infection and 

30 lytic growth from cell-mediated spread of the virus. 
In addition, clinical observations suggest that the 
HIV virus is directly responsible for the central 
nervous system disorders seen in many AIDS patients. 

The tropism of the HIV virus for T4* cells 

35 is believed to be attributed to the role of the T4 
cell surface glycoprotein as the membrane-anchored 
virus receptor. Because T4 behaves as the HIV virus 
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receptor, its extracellular sequence probably plays 
a direct role in binding HIV. A r specifically, it 
is believed that HIV envelope selectively binds to 
the T4 epitope(s), using this interaction to initiate 
5 entry into the host cell [A. G. Oalgelish et al., 
"The CD4 (T4) Antigen Is An Essential Component Of 
The Receptor For The AIDS Retrovirus", Nature , 312, 
pp. 763f-67 (1984); D. Klatzmann et al. , "T- Lymphocyte 
T4 Molecule Behaves As The Receptor For Human Retro- 

10 virus LAV" , Nature , 312, pp. 767-68 (1984)]. Accord- 
ingly, cellular expression of T4 is believed to be 
sufficient for HIV binding, with the T4 protein 
serving as a receptor for the HIV virus. 

The T4 tropism of the HIV virus has been 

15 demonstrated in vitro . When HIV virus isolated from 
AIDS patients is cultured together with T helper 
lymphocytes preselected for surface T4, the lympho- 
cytes are efficiently infected, display cytopathic 
effects, including multinuclear syncytia formation 

20 and are killed by lytic growth [D. Klatzmann et al . , 
"Selective Tropism Of Lymphadenopathy Associated 
Virus (LAV) For Helper- Inducer T Lymphocytes 1 *, 
Science , 225 . pp, 59-63 (1984); F. Wong-Staal and 
R. C. Gallo, "Human T-Lympho tropic Retroviruses", 

25 Nature, 317, pp. 395-403 (1985)]. It has been demon- 
strated that a cloned cDNA version of human T4, when 
expressed, on the surface of trans fected cells from 
non-T cell lineages, including murine and fib rob 1 as - 
toid cells, endows those cells with the ability to 

30 bind HIV [P. J. Maddon et al., "The T4 Gene Encodes 

The AIDS Virus Receptor And Is Expressed In The Immune 
System And The Brain", Cell, 47, pp. 333-48 (1986)]. 

During the course of HIV infection, the 
host mounts both a humoral and a cellular immune 

35 response to the virus. These responses include the 
appearance of antibodies which bind to a number of 
viral products and which exhibit neutralizing effect 
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or antibody dependent cellular cytotoxic functions 
[M. Guroff-Robert et al . , "HTL\ -I I I -Neutralizing 
Antib dies In Patients With AIDS And AIDS-Related 
Complex", Nature , 316 r pp. 72-74 (1985); D. D. F. 
5 Barin et al., "Virus Envelope Protein Of HTLV-m 
Represents Major Target Antigen For Antibodies In 
AIDS Patients", Science , 228, pp. 1094-96 (1985); 
A. H. Rook et al., "Sera From HTLV-IH/lav Antibody 
Positive Individuals Mediate Antibody Dependent 
10 Cellular Cytotoxicity Against ETLV-m/LAV Infected 
T Cells", J. Immunol. , 138, pp. 1064-68 (1987)]. 
Epitopes of the HIV envelope have been identified as 
important determinants in eliciting a neutralizing 
antibody response. And, determinants in antibody 
15 dependent cellular cytotoxicity ("ADCC") activity 
include HIV env and, possibly, gag epitopes. 

In the absence to date of effective treat- 
ments for AIDS, many efforts have centered on preven- 
tion of the disease. Such preventative measures 
20 include HIV antibody screening for all blood, organ 
and semen donors and education of AIDS high-risk 
groups regarding transmission of the disease. 

Experimental or early-stage clinical treat- 
ment of AIDS and ARCS conditions have included the- 
25 administration of antiviral drugs, such as HPA-23, 

phosphonof ornate, suramin, ribavirin, azido thymidine 
( "AZT" ) and dideoxycytidine , which apparently inter- 
fere with replication of the virus through reverse 
transcriptase inhibition. Although each of these 
30 drugs exhibits activity against HIV in vitro , only 
AZT has demonstrated potential benefits in clinical 
trials. AZT administration in effective amounts, 
however, has been accompanied by undesirable and 
debilitating side effects, such as bone marrow 
35 depression. It is likely, therefore, that hemato- 
logic toxicity will be a major rate limiting factor 
in the long term use of AZT. 
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Other proposed methods for treating AIDS 
have focused on the development of agents having 
activity against steps in the viral replicative cycle 
other than reverse transcription. Such methods 
5 include the administration of interferons or the 

application of hybridoma technology. Most of these 
treatment strategies are expected to require the 
co- administration of immunomodulators, such as inter- 
leukin-2 . 

10 To date, the need exists for the develop- 

ment of effective immuno therapeutic agents and methods 
for the treatment of AIDS, ARCS, HIV infection and 
other immunodeficiencies caused by T lymphocyte 
depletion or abnormalities. 

15 DISCLOSURE OF THE INVENTION 

The present invention solves the problems 
referred to above by providing, in large amounts, 
soluble T4 and soluble derivatives thereof that act 
as receptors for infective agents whose primary target 

20 is the T4 surface protein of T4 + lymphocytes. Advan- 
tageously, this invention also provides soluble T4 
essentially free of other proteins of human origin 
and in a form that is not contaminated by viruses, 
such as HIV or hepatitis B virus . 

25 As will be appreciated from the disclosure 

to follow, the DNA sequences and recombinant DNA 
molecules of this invention are capable of directing, 
in an appropriate host, the production of soluble T4 
or derivatives thereof. The polypeptides of this 

30 invention are useful, either as produced in the host 
or after further derivatization or modification, in 
a variety of immunotherapeutic compositions and 
methods for treating immunodeficient patients 
suffering from diseases caused by infective agents 

35 whose primary targets are T4 + lymphocytes. According 
to various embodiments of this invention, such compo- 
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sitions and methods relate to a soluble receptor for 
HIV, soluble T4 proteins and pc ^ypeptides and anti- 
bodies thereto. The soluble T4 proteins and polypep- 
tides of this invention include monovalent, as well 
5 as polyvalent forms . 

The compositions and methods of this inven- 
tion, which are based upon soluble T4 proteins, poly- 
peptides or peptides and antibodies thereto, axe 
particularly useful for the prevention, treatment or 

10 detection of the HIV-related infections AIDS and 

ARC. More specifically, the soluble T4-based com- 
positions and methods of this invention employ 
soluble T4-li)ce polypeptides — polypeptides which 
advantageously interfere with the T4/HIV interaction 

15 by blocking or competitive binding mechanisms which 
inhibit HIV infection of cells expressing the T4 
surface protein- These soluble T4-like polypeptides 
inhibit adhesion between T4 lymphocytes and infec- 
tive agents which target T4* lymphocytes and inhibit 

20 interaction between T4* lymphocytes and antigen pre- 
senting cells and targets of T4* lymphocytes mediated 
killing. By acting as soluble virus receptors, the 
compositions of this invention may be used as anti- 
viral therapeutics to inhibit HIV binding to T4* 

25 cells and viral ly induced syncytium formation at the 
level of receptor binding. 

This invention accomplishes these goals by 
providing DNA sequences coding on expression in an 
appropriate unicellular host for soluble T4 proteins* 

30 and soluble derivatives thereof 



* As used in this application, "soluble T4 pro- 
tein", "soluble T4" and "soluble T4-like polypeptides" 
include all proteins, polypeptides and peptides which 
35 are natural or recombinant soluble T4 proteins, or 
soluble derivatives thereof, and which are charac- 
terized by the immuno therapeutic ( anti-retroviral ) 

(footnote continued on following page) 
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This invention also provides recombinant 
DNA molecules containing th se JNA sequences and 
unicellular hosts transformed with them. Those hosts 
permit the production of large quantities of the 
novel soluble T4 proteins, polypeptides, peptides 
and derivatives of this invention for use in a wide 
variety of therapeutic, prophylactic and diagnostic 
compositions and methods 

The DNA sequences of this invention are 
selected from the group consisting cf : 

(a) the DNA inserts of pl99-7, pBC377, 
PBG380, pBG381, p203-5, pBG391, pBG392, pBC393, 
PBG394, PBG39S. pBG396, pBG397, p211-ll. p214-10 
and p215-7; 

15 < b > DNA sequences which hybridize to one 

or more of the foregoing DNA inserts and which code 
on expression for a soluble T4-like polypeptide; and 

(c) DNA sequences which code on expression 
for a soluble T4-liJce polypeptide coded for on 

20 expression by any of the foregoing DNA inserts and 
sequences . 

According to an alternate embodiment, this 
invention also relates to a DNA sequence comprising 
the DNA insert of pl70-2, said sequence coding on 
25 expression for a T4-like polypeptide. And. this 

invention also relates to recombinant DNA molecules 
and processes for producing T4 protein using that 
DNA sequence. 



30 (footnote continued from preceding page) 

or immunogenic activity of soluble T4 protein. They 
include soluble T4-like compounds from a variety of 
sources, such as soluble T4 protein derived from 
natural sources, recombinant soluble T4 protein and 
synthetic or semi-synthetic soluble T4 protein 



35 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is an autoradi graph depicting 
the purificati n of T4 protein from U937 cells by 
immunoaf finity chromatography. 
5 Figure 2 depicts autoradiograph and Western 

blot data demonstrating that immunoaf finity-purif ied, 
solubilized native T4 protein binds to HIV envelope 
protein. 

Figure 3 depicts the nucleotide sequence 
10 and the derived amino acid sequence of T4 cDNA 

obtained from PBL clone X203-4. In this figure, the 
amino acids are represented by single letter codes 
as follows : 

Phe: F Leu: L He: I Met: M 

15 Val: V Ser: S Pro: P Thr: T 

Ala: A Tyr: Y His: H Gin: Q 

Asn: N Lys: K Asp: D Glu: E 

Cys: C Trp: W Arg: R Gly: G 



= position at which a stop codon is 



20 present. 



In Figure 3, the T4 protein translation 
start (AA_23) " located at the methionine at nucleo- 
ides 201-203 and the mature N- terminus is located at 
the lysine (AA 3 ) at nucleotides 276-278. 

25 Figure 4 is a schematic outline of the 

construction of cDNA clones pBG312.T4 (also called 
pl71-l) and p!70-2. 

Figure 5 is a schematic outline of the 
construction of plasmid pEClOO. 

30 Figure 6 depicts amino acid comparisons at 

a positions 3* 64 and 231 of various T4 cDNA clones. 

Figures 7A and 7B depict the protein domain 
structure of purified, solubilized T4 protein and 
recombinant soluble T4 mutants. 

35 Figures 8A-8D are schematic outlines of 

constructions of various intermediate plasmids and 
other plasmids used to express recombinant soluble 
T4 ("rsT4" ) of this invention. 
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Figure 9A is a schematic outline of the 
construction of plasmid pl99-7. 

Figures 9B and 9C are schematic outlines 
of the construction of plasxaid p203-5. 
5 Figure 10 depicts the synthetic oligo- 

nucleotide linkers employed in various constructions 
according to this invention. 

Figure 11 depicts the nucleotide sequence 
of the entire plasmid defined by p!99-7 (P L mutet.rsT4 ) 
10 and its rsT4.2 insert and the amino acid sequence 
deduced from the rsT4 sequence. This Includes the 
Clal-Clal cassette which defines the Met perfect 
rsT4.2 coding sequence. 

Figure 12 depicts a protein blot analysis 
15 of an induction of rsT4.2 expression from 
SG936/pl99-7. 

Figure 13 is a schematic outline of the 
construction of plasmid pBG368. 

Figures 14A-14C are schematic outlines of 
20 constructions of various plasmids of this invention. 

Figure 15 depicts the nucleotide sequence 
of plasmid pBG391. 

Figure 16 depicts the nucleotide sequence 
of plasmid pBG392. In this figure, the T4 protein 
25 translation start (AA_ 23 ) is located at the methio- 
nine at nucleotides 1207-1209 and the mature 
N- terminus is located at the lysine (AA 3 ) at nucleo- 
tide 1281-84. 

Figure 17 is a schematic outline of con- 
30 structions of various plasmids of this invention. 

Figure 18 depicts the synthetic oligonucleo- 
tide linkers employed in various constructions 
according to this invention. 

Figure 19 depicts the nucleotide sequence 
35 of plasmid pBG394. 

Figure 20 depicts the nucleotide sequence 
of plasmid pBG396. 
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Figure 21 depicts the nucleotide sequence 
of plasmid pBG393. 

Figure 22 depicts the nucleotide sequence 
of plasmid pBG395. 

Figure 23 is a Coomassie stained gel of 
rsT4.2 purified from the conditioned medium of the 
pBG380 transfected CHO cell line BG380G of plasmid 
pl96-10. 

Figure 24 is a schematic outline of the 
construction of plasmid pl96-10. 

Figure 25 is a schematic outline of the 
construction of plasmid pBG394 . 

Figure 26 is a schematic outline of the 
construction of plasmid p211-ll. 

Figure 27 is a schematic outline of the 
construction of plasmid p215-7. 

Figure 28 is a schematic outline of the 
construction of plasmid p218-8. 

Figure 29A is a Coomassie stained gel of 
rsT4. 113.1 purified from the conditioned medium of 
PBG211-11 transfected E.coli . 

Figure 29B is an autoradio graph depicting 
a Western blot analysis of rsT4. 113.1 expressed in 
E.coli . 

Figure 30, panels (a)-(c) depict the puri- 
fication of rsT4. 113.1 from E.coli trans formants . 

Figure 31, panels (a)-(c) depict the 
refolding of purified rsT4. 113.1. 

Figure 32 is an autoradiograph depicting 
the immunoprecipitation of 35 S-metabolically labelled 

— GHG -cell —lines- producing recombinant- soluble- T4~ 

Figure 33 depicts an immunoblot analysis 
of COS 7 cell lines producing recombinant soluble T4. 

Figure 34 depicts in graphic form the 
results of a competition assay between rsT4. 113.1 
and rsT4.3 for binding to OKT4A or OKT4. 
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Figures 35-37 depict in graphic form the 
results f competition assays ..etween rsT4.111 and 
rsT4.3 for binding to, respectively, OKT4A, Leu-3A 
and OKT4. 

5 Figure 38 depicts in graphic form an ELISA 

assay for rsT4. 113.1 from E.coli trans form an ts . 

Figure 39 depicts in graphic form the 
results of a p24 radioimmunoassay using recombinant 
soluble T4 according to this invention, 
10 Figures 40 and 41 depict the results of 

syncytia inhib ition assays using recombinant soluble 
T4 proteins according to this invention. 

Figure 42 is a schematic outline of the 
construction of plasmid pBiv.l. 
15 Figure 43 depicts the bivalent recombinant 

soluble T4 protein produced by pBiv.l. 

DETAILED DESCRIPTION OF THE INVENTION 

We isolated the DNA sequences of this 
invention from two libraries: a Xgt cDNA library 

20 derived the T cell tumor line REX and a XgtlO cDNA 
library derived from peripheral blood lymphocytes. 
However, we could also have employed libraries pre- 
pared from other cells that express T4. These 
include, for example, H9 and U937. We also used a 

25 human genomic bank to isolate various fragments of 
the T4 gene. 

For screening these libraries, we used a 
series of chemically synthesized anti-sense oligo- 

nucleotide DNA probes based upon the T4 protein 

30 sequence set forth in Maddon et al. (1985), supra. 

For screening, we hybridized our oligo- 
nucleotide probes to our cDNA libraries utilizing a 
plaque hybridization screening assay. We selected 
clones hybridizing to several of our probes. And, 
35 after isolating and subcloning the cDNA inserts of 
the selected clones into plasraids, we determined 



3664 



89085519 



-15- 



their nucleotide sequences and compared th amino 
acid sequences dedv :ed from those nucleotide sequences 
to the amino acid sequences referred to in Maddon 
et al. (1985), supra. As a result of these compari- 
5 sons, we determined that all of our selected clones 
were characterized by cONA inserts coding for amino 
acid sequences of human T4. 

We have depicted in Figure 3 the nucleo- 
tide sequence of full-length T4 cONA obtained from 

10 deposited clone p!70-2 and the amino acid sequence 
deduced therefrom. That cDKA sequence was subse- 
quently subjected to in vitro site-directed mutagen- 
esis and restriction fragment substitution so that 
its cDNA sequence was identical to that of Maddon 

15 et al. 

After modifying our T4 cDNA sequence to be 
identical to that of Maddon et al . , we truncated 
samples of it in various positions to remove the 
coding regions for the transmembrane and intracyto- 

20 plasmic domains. The remaining cONA sequences encoded 
a soluble T4 w hich retained the extracellular region 
believed to be responsible for HIV binding. 

We then constructed various clones charac- 
terized by such cDKA inserts coding for human soluble 

25 T4. Those cDNA sequences may be used in a variety 
of ways in accordance with this invention. More 
particularly, those sequences or portions of them, 
or synthetic or semi -synthetic copies of them, may 
be used as DNA probes to screen other human or animal 

30 cDNA or genomic libraries to select by hybridization 
other DNA sequences that are related to soluble T4. 
Typically, conventional hybridization conditions, 
"e.g., about 20° to 27°C below Tm, are employed in 
such selections. However, less stringent conditions 

35 may be necessary when the library is being screened 
with a probe from a different species than that from 
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which the library is derived, e.g., the screening of 
a mouse library with a human pr oe . 

Such cDNA inserts, portions of them, or 
synthetic or semi -synthetic copies of them, may also 
5 be used as starting materials to prepare various 

mutations. Such mutations may be either degenerate, 
i.e., the mutation does not change the amino acid 
sequence encoded by the mutated codon, or non- 
degenerate, i.e., the mutation changes the amine 

10 acid sequence encoded by the mutated codon. Both 

types of mutations may be advantageous in producing 
or using soluble T4*s according to this invention. 
For example, these mutations may permit higher levels 
of production or easier purification of soluble T4 

15 or higher T4 activity. 

For all of these reasons, the DNA sequences 
of this invention are selected from the group con- 
sisting of: 

(a) the DNA inserts of p!99-7, pBG377, 

20 pBG380, pBG381, p203-5, pBG391, pBG392, pBG393, pBG394, 
pBC395, pBG396, pCG397, p211-ll, p214-10 and p215-7; 

(b) DNA sequences which hybridize to one 
or more of the foregoing DNA inserts and which code 
on expression for a soluble T4-like polypeptide; and 

25 (c) DNA sequences which code on expres- 

sion for a soluble T4-like polypeptide coded for on 
expression by any of the foregoing DNA inserts and 
sequences. 

Preferably, the DNA sequences of this 

_3_0 invention, code__f or_a polypeptide- selected from- the 

group consisting of a polypeptide of the formula 
AA_ 23 -AA 362 of Figure 3, a polypeptide of the formula 
AA 1 _ 362 of Figure 3, a polypeptide of the formula 
Met-AA 1-362 of Figure 3, a polypeptide of the formula 
35 AA 1-3?4 of Figure 3, a polypeptide of the formula 

Met-AA 1 _ 3?4 of Figure 3, a polypeptide of the formula 
AA, „ 7 of Figure 3, a polypeptide of the formula 
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Met-AA 1-377 of Figure 3, a polypeptide of the formula 
AA_ 23 -AA 3?4 of Figure 3, a polyp ptide of the formula 
AA_ 23 -AA 377 of Figure 3, or portions thereof. 

DNA sequences according to this invention 
5 also preferably code for a polypeptide selected from 
the group consisting of a polypeptide of the formula 
AA_ 23 -AA 1Q2 of Figure 16, a polypeptide of the 
formula AA i" AA ia2 °^ Figure 16, a polypeptide of 
the formula Met-AA^_^ Q2 of Figure 16, a polypeptide 
10 of the formula AA_ 23 -AA 1Q2 of Figure 16, followed by 

the amino acids asparagine-leucine-glutamine-histidine- 
serine-leucine, a polypeptide of the formula 
AA^-AA 182 of Figure 16, followed by the amino acids 
asparagine-leucine-glutamine-histidine-serine-leucine, 
15 a polypeptide of the formula Met -AA 1 . 182 of Figure 16, 
followed by the amino acids asparagine-leucine- 
glutamine-histidine-serine-leucine, a polypeptide of 
the formula AA_ 23 -AA 113 of Figure 16, a polypeptide 
of the formula AA 1 -AA 113 of Figure 16, a polypeptide 
of the formula ^t-AAj^^ of Figure 16, a polypeptide 
of the formula AA^S'^lll of Figure 16, a polypeptide 
of the formula AA^-AA^^ of Figure 16, a polypeptide 
of the formula Met-AA 1 _ 111 of Figure 16, a polypep- 
tide of the formula ^^23^^131 of Figure 16, a poly- 
25 peptide of the formula AA 1 -AA 131 of Figure '16, a 

polypeptide of the formula Met " A A 1 . 131 of Figure 16, 
a polypeptide of the formula AA_ 23 «AA 145 of Figure 16, 
a polypeptide of the formula AA 1 ~AA 145 of Figure 16, 
a polypeptide of the formula Met "AA 1-145 of Figure 16, 
_30 a L polypeptide of formula AA -23 -AA 166 of _Figure 16^, 

a polypeptide of the formula AA 1 -AA 166 of Figure 16, 
3 polypeptide of the formula Met-AA 1-166 of Figure 16, 
or portions thereof. 

Additionally, DNA sequences of this inven- 
35 tion code for a polypeptide selected from the group 

consisting of a polypeptide of the formula AA -23 -AA 362 

of mature T4 protein, a polypeptide of the formula 
3667 
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AA 1 362 of mature T4 protein, a polypeptide of the 
formula Met-AA 1-362 of mature protein, a polypep- 
tide of the formula AA 1 _37 4 of mature T4 protein, a 
polypeptide of the formula Met-AA 1-3?4 of mature T4 
5 protein, a polypeptide of the formula AA 1-377 of 
mature T4 protein, a polypeptide of the formula 
Met-AA 1-3?7 of mature T4 protein, a polypeptide of 
the formula AA_ 23 -AA 374 of mature T4 protein, a poly- 
peptide of the formula AA_ 23 -AA 377 of mature T4 pro- 

10 tein, or portions thereof. 

DNA sequences according to this invention 
also code for a polypeptide selected from the group 
consisting of a polypeptide of the formula AA _23~ AA 182 
of mature T4 protein, a polypeptide of the formula 

15 AA i" AA i82 o£ Bature T4 Protein, a polypeptide of the 
formula Wet - AA i_i82 of mature T4 Protein, a polypep- 
tide of the formula AA _23~ AA 182 of mature T4 Protein, 
followed by the amino acids aspaxagine- leucine- 
glutamine-histidine-serine-leucine, a polypeptide of 

20 the formula AA i~ AA i82 of mature T4 protein, followed 
by the amino acids asparagine-leucine-glutamine- 
his tidine-serine-leucine , a polypeptide of the 
formula Met-AA 1-182 of mature T4 protein, followed 
by the amino acids asparagine-leucine-glutamine- 

25 histidine-serine-leucine, a polypeptide of the 

formula AA _23~ AA 113 of mature T4 protein, a polypep- 
tide of the formula AA^-AA^^ of mature T4 protein, 
a polypeptide of the formula Met-AA 1-113 of mature 
T4 protein, a polypeptide of the formula AA _23~ AA 111 

30 "of mature T4 protein,- a polypeptide of the formula- 
- AA^-AA 111 of mature T4 protein, a polypeptide of the 
* formula Met-AA 1-ni of mature T4 protein, a polypep- 
tide of the formula AA_ 23 «AA 131 of mature T4 protein, 
a polypeptide of the formula AA 1 «AA 131 of mature T4 

35 protein, a polypeptide of the formula Met-AA 1-131 of 
mature T4 protein, a polypeptide of the formula 
AA 23 -AA 145 of mature T4 Protein, a polypeptide of 
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the formula AA^-AA^^ of mature ^4 protein, a polypep- 
tide of the formula Met-AA 1-145 of nature T4 protein, 
a polypeptide of the formula ^.23*^166 °^ mature 
T4 protein, a polypeptide of the formula ^^"^166 
5 of mature T4 protein, a polypeptide of the formula 

Met-AA 1-166 of mature T4 protein, or portions thereof. 

The amino terminal amino acid of mature 
T4 protein isolated from T cells begins at lysine, 
the third amino acid of the sequence depicted in 

10 Figure 16. Accordingly, soluble T4 proteins also 
include polypeptides of the formula AA 3 ~AA 377 of 
Figure 16, or portions thereof. Such polypeptides 
include polypeptides selected from the group consist- 
ing of a polypeptide of the formula AA-j to AA 362 of 

15 Figure 16, a polypeptide of the formula AA 3 to AA^j^ 
of Figure 16, a polypeptide of the formula AA 3 -AA 182 
of Figure 16, a polypeptide of the formula AA 3" AA 11 3 
of Figure 16, a polypeptide of the formula AA-^-AA^^ 
of Figure 16, a polypeptide of the formula AA 3 -AA 145 

20 of Figure 16, a polypeptide of the formula AA 3 ~AA 166 
of Figure 16, and a polypeptide of the formula 
AA 3 -AA 111 of Figure 16. Soluble T4 proteins also 
include the above-recited polypeptides preceded by 
an N- terminal methionine group. 

25 Soluble T4 protein constructs according to 

this invention may also be produced by truncating 
the full length T4 protein sequence at various posi- 
tions to remove the coding regions for the transmem- 
brane and intracytoplasmic domains, while retaining 

30 the extracellular region believed to be responsible 
for HIV binding: More particularly, soluble T4~ 
" polypeptides may be produced by conventional tech- 
niques of oligonucleotide directed mutagenesis; 
restriction digestion, followed by insertion of 

35 linkers; or chewing back full length T4 protein with 
enzymes . 
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Alternatively, soluble T4 polypeptides may 
be chemically synthesized by conventional peptide 
synthesis techniques, such as solid phase synthesis 
[R. B. Merrifield, "Solid Phase Peptide Synthesis. 
5 I. The Synthesis Of A Tetrapeptide" , J . Am. Chem . 
Soc. , 83, pp, 2149-54 (1963)]. 

The DNA sequences of this invention code 
for soluble proteins and derivatives that are believed 
to bind to Major Histocompatibility Complex antigens 
10 and envelope glycoprotein of certain retroviruses, 

such as HIV. Preferably, they also inhibit syncytium 
formation, believed to be the mode of intracellular 
HIV virus spread. And, they may inhibit interaction 
between T4 lymphocytes and antigen-presenting cells 
15 and targets of T4 + cell mediated killing. Most 

preferably, they also inhibit adhesion between T4* 
lymphocytes and infective agents, such as the HIV 
virus, whose primary targets are T4* lymphocytes . 

The DNA sequences of this invention are 
20 also useful for producing soluble T4 or its deriva- 
tives coded for on expression by them in unicellular 
hosts transformed with those DNA sequences. As well 
known in the art, for expression of the DNA sequences 
of this invention, the DNA sequence should be opera- 
25 tively linked to an expression control sequence in 
an appropriate expression vector and employed in 
that expression vector to transform an appropriate 
unicellular host. 

Such operative linking of a DNA sequence 
30 of tliis invention to an expression control sequence, 
of course, includes the provision of a translation 
start signal in the correct reading frame upstream 
of the DNA sequence. If the particular DNA sequence 
of this invention being expressed does not begin 
35 with a methionine, the start signal will result in 
an additional amino acid — methionine — being 
located at the N— terminus of the product. While 
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such me thionyl -containing product may be employed 
directly in the compositi ns and methods f this 
invention, it is usually more desirable to remove 
the methionine before use. Methods are available in 
5 the art to remove such N- terminal methionines from 
polypeptides expressed with them. For example, 
certain hosts and fermentation conditions permit 
removal of substantially all of the N- terminal 
methionine in vivo . Other hosts require in vitro 
10 removal of the N- terminal methionine. However, such 
in vivo and in vitro methods are well known in the 
art. 

A wide variety of host/expression vector 
combinations may be employed in expressing the DNA 

15 sequences of this invention. Useful expression 
vectors, for example, may consist of segments of 
chromosomal, non-chromosomal and synthetic OKA 
sequences, such as various known derivatives of SV40 
and known bacterial pi asmids, e.g., plasmids from 

20 E.coli including col El, pCRl, pBR322, pMB9 and their 
derivatives, wider host range plasmids, e.g., RP4, 
phage DHAs, e.g., the numerous derivatives of phage X, 
e.g., NM989, and other DNA phages, e.g., K13 and 
filamenteous single stranded DNA phages, yeast plas- 

25 mids, such as the 2p plasmid or derivatives thereof, 
and vectors derived from combinations of plasmids 
and phage DNAs, such as plasmids which have been 
modified to employ phage DNA or other expression 
control sequences. For animal cell expression, we 

30 prefer to use plasmid pBG368, a derivative of pBG3 12 
[R. Cate et al., "Isolation Of The Bovine And Human 
Genes For Mullerian Inhibiting Substance And 
Expression Of The Human Gene In Animal Cells", Cell , 
45, pp. 685-98 (1986)] which contains the major late 

35 promoter of adenovirals 2. 

In addition, any of a wide variety of 
expression control sequences — sequences that con- 
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trol the expression of a DNA sequence when opera- 
tively linked to it — may be us in these vectors 
to express the DNA sequence of this invention. Such 
useful expression control sequences , include, for 
example , the early and late promoters of SV40 or the 
adenovirus, the lac system, the trp system, the TAG 
or TRC system, the major operator and promoter regions 
of phage A, the control regions of fd coat protein, 
the promoter for 3-phosphoglycerate kinase or other 
glycolytic enzymes, the promoters of acid phosphatase, 
e.g., PhoS, the promoters of the yeast a -mating 
factors, the polyhedron promoter of the baculovirus 
system and other sequences known to control the 
expression of genes of prokaryotic or eukaryotic 
cells or their viruses, and various combinations 
thereof. For animal cell expression, we prefer to 
use an expression control sequence derived from the 
major late promoter of adenovirus 2. 

A wide variety of unicellular host cells 
are also useful in expressing the DNA sequences of 
this invention. These hosts may include well known 
eukaryotic and prokaryotic hosts, such as strains of 
E.coli , Pseudomonas m Bacillus , Streptomyces , fungi, 
such as yeasts, and animal cells, such as CEO and 
mouse cells, African green monkey cells, such as 
COS 1, COS 7, BSC 1, BSC 40, and BMT 10, insect cells, 
and human cells and plant cells in tissue culture. 
For animal cell expression, we prefer CHO cells and 
COS 7 cells. 

It should of course be understood that- not 
all vectors and expression control sequences will 
function equally well to express the DNA sequences 
of this invention. Neither will all hosts function 
equally well with the same expression system. How- 
ever, one of skill in the art may make a selection 
among these vectors, expression control sequences, 
and hosts without undue experimentation and without 
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departing from the scope of "this invention. For 
example, in selecting a vector, the h st must be 
considered because the vector must replicate in it. 
The vector's copy number, the ability to control 
5 that copy number, and the expression of any other 
proteins encoded by the vector, such as antibiotic 
markers, should also be considered. 

In selecting an expression control sequence, 
a variety of factors should also be considered. 

10 These include, for example, the relative strength of 

the system, its controllability, and its compatibility 
with the particular DNA sequence of this invention, 
particularly as regards potential secondary struc- 
tures. Unicellular hosts should be selected by 

15 consideration of their compatibility with the chosen 
vector, the toxicity of the product coded for on 
expression by the DNA sequences of this invention to 
them, their secretion characteristics, their ability 
to fold proteins correctly, their fermentation re- 

20 quirements, and the ease of purification of the 

products coded on expression by the DNA sequences of 
this invention. 

Wit hin these parameters, one of skill in 
the art may select various vector/expression control 

25 system/host combinations that will express the DNA 

sequences of this invention on fermentation or in large 
scale animal culture, e.g., CHO cells or COS 7 cells. 

The polypeptides produced on expression of 
the DNA sequences of this invention may be isolated 

30 from the fermentation or animal cell cultures and 
purified using any of a variety of conventional 
methods. One of skill in the art may select the 
most appropriate isolation and purification tech- 
niques without departing from the scope of this 

35 invention. 

The polypeptides produced on expression of 
the DNA sequences of this invention are essentially 
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free cf other proteins of human origin. Thus, they 
are different than T4 protein j, irified from human 
lymphocytes . 

The polypeptides of this invention are 
5 useful in immuno therapeutic compositions and methods. 
For example, the polypeptides of this invention are 
active in inhibiting infection by agents whose primary 
targets are T4 + lymphocytes by interfering with their 
interaction with those target lymphocytes. More 

10 preferably, the polypeptides of this invention may 

be employed to saturate the T4 receptor sites of T4- 
targeted infective agents. Thus, they exert anti- 
viral activity by competitive binding with cell 
surface T4 receptor sites. This effect is plainly 

15 of great utility in diseases, such as AIDS, ARC and 
HIV infection. Accordingly, the polypeptides and 
methods of this invention may be used to treat humans 
having AIDS, ARC, HIV infection or antibodies to 
HIV. In addition, these polypeptides and methods 

20 may be used for treating AIDS-like diseases caused 
by retroviruses, such as simian immunodeficiency 
viruses, in mammals, including humans. 

According to one embodiment of this inven- 
tion, antibodies to soluble T4 proteins and polypep- 

25 tides may be used in the treatment, prevention, or 
diagnosis of AIDS, ARC and HIV infection. 

The polypeptides of this invention may 
also be used in combination with other therapeutics 
used in the treatment of AIDS, ARC and HIV infection. 

30 For - example; soluble T4 polypeptides may be used in 
combination with anti-retroviral agents that block 
'reverse transcriptase, such as AZT, HPA-23, phos- 
phono formate , suramin, ribavirin and dideoxyciti- 
dine. Additionally, these polypeptides may be used 

35 with anti -viral agents such as interferons, includ- 
ing alpha interferon, beta interferon and gamma 
interferon, or glucosidase inhibitors, such as 



3674 



-25- 

castanospermine . Such combination therapies advan- 
tageously utilize lower dosages o those agents, 
thus av iding possible toxicity. 

And, the polypeptides of this invention 
5 may be used in plasmapheresis techniques or in blood 
bags for selective removal of viral contaminants 
from blood. According to this embodiment of the 
invention, soluble T4 polypeptides may be coupled to 
a solid support, comprising, for example, plastic or 

10 glass beads, or a filter, which is incorporated into 
a plasmapheresis unit. 

Additionally, the compositions of this 
invention may be employed as immunosuppressants use- 
ful in preventing or treating graft-vs-host disease, 

15 autoimmune diseases and allograft rejection. 

The compositions of this invention typi- 
cally comprise an immunotherapeutic effective amount 
of a polypeptide of this invention and a pharmaceu- 
tically acceptable carrier- Therapeutic methods of 

20 this invention comprise the step of treating patients 
in a pharmaceutic ally acceptable manner with those 
compositions . 

The compositions of this invention for use 
in these therapies may be in a variety of forms. 

25 These include, for example, solid, semi-solid and 

liquid dosage forms, such as tablets, pills, powders, 
liquid solutions or suspensions, liposomes, supposi- 
tories, injectable and inf usable solutions. The 
preferred form depends on the intended mode of admin- 

30 istration and therapeutic application. The composi- 
tions also preferably include conventional pharma- 
ceutical^ acceptable carriers and adjuvants which 
are known to those of skill in the art. 

Generally, the pharmaceutical compositions 

35 of the present invention may be formulated and admin- 
istered using methods and compositions similar to 
those used for other pharmaceutically important poly- 
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peptides (e.g., alpha- interferon ) . Thus, the poly- 
peptides may be st red in lyoph. xized f rm, reconsti- 
tuted with sterile water just prior to administration , 
and administered by the usual routes of administration 
5 such as parenteral, subcutaneous, intravenous, intra- 
muscular or intralesional routes. An effective dosage 
may be in the range of from 0.5 to 5.0 mg/kg body 
weight/day, it being recognized that lower and higher 
doses may also be useful. 

10 This invention also relates to soluble 

receptors and their use in diagnosing or treating 
viral agents which target or bind to those receptors. 
Such soluble receptors may be used as decoys to 
absorb viral agents and to halt the spread of viral 

15 infection- Alternatively, virus-killing agents may 
be attached to the soluble protein receptors, 
providing a direct mode of delivery of those agents 
to the virus. 

More particularly, the polypeptides of 

20 this invention are useful in diagnostic compositions 
and methods to detect or monitor the course of HIV 
infection. Advantageously, these polypeptides are 
useful in diagnosing variants of the HIV virus, 
regardless of origin of the infecting HIV agent. 

25 For example, soluble T4 proteins and poly- 

peptides according to this invention, which have a 
high affinity for HIV, may be advantageously used to 
increase the sensitivity of HIV assay systems now 
based upon monoclonal or polyclonal antibodies . 

30 More specifically, soluble T4 proteins and polypep- 
tides may be used to pretreat test plasma to concen- 
~ urate any HIV present, even in small amounts, so 
that it is more easily recognized by the antibody. 
And soluble T4 proteins and polypeptides may be used 

35 to purify the HIV envelope protein gpl20. 

Alternatively, the soluble T4 proteins and 
polypeptides of this invention may be used to replace 
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anti-HIV antibodies now used in /arious assays. 
These soluble T4 proteins and polypeptides are be 
preferable to anti-HIV antibodies for two reasons. 

First, soluble T4, exhibits an affinity for HIV of 

-9 -7 
5 approximately 10 , a level which exceeds the 10 

to 10" 8 values of anti-HIV antibodies . And, while 

anti-HIV antibodies are more likely to be specific 

for different HIV isolates, strain variations would 

not affect a soluble T4 protein-based assay, since 

10 all HIV isolates must be capable of interacting with 
the T4 receptor as a prerequisite to inf ectivi ty . 

For example, a soluble T4 protein or poly- 
peptide may be linked to an indicator, such as an 
enzyme, and used in an ELISA assay. Here, soluble 

15 T4 advantageously acts as a measure of both HIV in a 
test sample and any free HIV envelope gpl20 protein. 

And, polyvalent forms of soluble T4 prote? . 
or polypeptides may be produced, for example, by 
chemical coupling or genetic fusion techniques, thus 

20 increasing even further the avidity of soluble T4 
for HIV. 

In order that this invention may be better 
understood, the following examples are set forth. 
These examples are for purposes of illustration only, 
25 and are not to be construed as limiting the scope of 
the invention in any manner. 

EXAMPLES 

Purification Of Native Solubilized T4 

We purified native - T4 from the T4 + -promono- 
30 " cytic cell line U937 derived from a histocytic 

lymphoma to approximately 50% purity using immuno- 
affinity chromatography as follows. 

We grew U937 cells [a gift from Dr. Scott 
Hammer, New England Deaconess Hospital] to 
35 10 6 cells/ml in RPMI 1640, 10% FCS, harvested and 

washed them in IX PBS. We then lysed the cell pellet 
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in 20 mM Tris-HCl (pH 7.7), 0.5% NF-40 (a non-i nic 
detergent), 0.2% NaDOC, 0.2 mM EGTA, 0.2 mM PMSF and 
5 pg/ml BPTI at 4 x 10 7 cells/ml. Because this 
purification was carried out in the presence of a 
5 non-ionic detergent, T4, which is normally membrane- 
bound via its hydrophobic transmembrane domain, was 
isolated as a solubilized protein. We spun the lysate 
in a GS 3 rotor for 10 min at 10,000 rpm and stored 
the supernatant at -70°C. 

10 Subsequently, we preabsorbed the clarified 

cell extract with mouse I gG-Sepharose . followed by 
protein A Sepharose and then passed the flow-through 
through an immunoaffinity column comprising immobil- 
ized 19Thy anti-T4 monoclonal antibody on Af figel-10 

15 [a gift from Dr. Ellis Reinherz, Dana Farber Cancer 
Institute, Boston, Massachusetts). We washed the 
column extensively and eluted the bound material 
with 50 mM glycine-HCl (pH 2.5), 0.15 M NaCl, 0.5% 
NP-40, 5 pg/ml BPTI and 0.2 mM EGXA. 

20 We then separated 10 pi aliguots of each 

elution fraction on a 10% SDS-PAGE under reducing 
conditions, with the bands being visualized by silver 
staining. As shown in Figure 1, a major silver- 
stained band of 55 Kd was visible. We theji carried 

25 out two assays on the 55 Kd protein and sequenced 
the amino terminus of the protein to confirm its 
identity as native solubilized T4. 

Sequencinq Of Native Solubilized T4 

----- We-determined- the N- terminal amino acid 

30 sequence of our solubilized native T4 which we 

isolated from a detergent extract of U937 cells by 
immunoaf f inity chromatography as described above. 

Techniques for determining the amino acid 
sequences of various proteins and peptides derived 
35 from them are well known in the art. We chose auto- 
mated Edman degradation to determine the amino 
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terminus f our solul ilized native T4. More speci- 
fically, we gel purified and electrc fluted approxi- 
mately 5 pg of the solubilized native T4 and then 
subjected it to automated Edman degradation using a 
5 gas phase sequencer (Applied Biosystems 470A). We 
then identified the PTH-amino acids produced at each 
cycle of the Edman chemistry by high pressure liquid 
chromatography, on-line with the sequencer , in a 
PTH-amino acid analyzer (Applied Biosystems 120A). 
10 Direct analysis of the protein provided amino terminal 
sequence information which , when compared to the 
amino acid sequence deduced from the cOHA sequence 
of human T4 f Maddon et al. (1985), supra), identified 
the purified protein as human T4. 

15 Radioimmunoassay Of Native Solubilized T4 

To determine that our purification process 
enriched for T4, we assayed fractions from the 
immunoaf f inity elution step in a T4-specific sandwich 
radioimmunoassay, based upon the ELISA assay of P. E. 

20 Rao et al-, in Cellular Immunology , 80, pp. 310-19 
(1983). We coated each well of a Removawell strip 
(Dynatech Labs, Alexandria, Virginia) with 50 pi of 
10 pl/ml OKT4 antibody (ATCC #CRL 8002) or MOPC195 
(a background binding control) in 0.05 M sodium 

25 bicarbonate buffer (pH 9.4) at 4°C overnight. We 

washed the wells and then filled them with 1% FCS in 
PBS to saturate the protein binding capacity of the 
plastic. After removing the 1% FCS solution, we 
added test samples, in 50 pi aliguots, to the wells. 

30 We then incubated the samples for 4 hours at room 
temperature. Subsequently, we removed the samples 
and washed the wells four times with 0.05% Tween-20 

12S 

in PBS. we then added I-labelled 19Thy antibody 
(50,000-100,000 cpm per well) and incubated the wells 
35 at 4°C overnight. We then washed the wells four 
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times and separated eich well for bound x< "i detection 
in a Beckman gamma detect r. 

As shown in Figure 1, in which values were 
plotted following subtraction for background, the 
5 peak fraction of solubilized native T4 protein 

detected by radioimmunoassay coincided with elution 
of the 55 Kd protein seen by silver staining. 

Western Blot Assay For T4 

Although many antibodies have been developed 
10 for detecting T4 antigen, none are useful for protein 
blot analysis (Dr. Ellis Reinherz, personal communi- 
cation ) . In order to develop antibodies useful for 
Western blot detection of soluble T4 to follow the 
purification of T4 and recombinant soluble T4, we 
15 raised polyclonal, hyperimmune anti-T4 antisera in 
rabbits against three synthetic T4 oligopeptides. 
These oligopeptides axe represented in Figure 3 as 
follows : 

Oligopeptide Amino Acid Coordinates 
20 JB-1 44-63 

JB-2 133-156 
JB-3 325-343 
We had previously synthesized these peptides using 
conventional phosphoamide DNA synthesis techniques . 
25 See, e.g.. Tetrahedron Letters , 22, pp. 1859-62 

(1981). We synthesized the peptides on an Applied 
Biosystems 380A DNA Synthesizer and purified them by 
gel electrophoresis. 

(i) Coupling Of T4 Peptides To BTG 

30 * We coupled each of these peptides to the 

carrier protein bovine thyrogobulin ("BTG") [Sigma, 
St. Louis , Missouri] according to a modification of 
procedures set forth in J. Rothbard et al., J. Exp. 
Med. , 160, pp. 208-21 (1984) and R. C. Kennedy et al., 

35 "Antiserum To A Synthetic Peptide Recognizes The 
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HTLV-III Envelope Glycoprotein", Science , 231, 
pp. 1556-59 (1986). 

More specifically, we mixed 10 mg of BTG 
diluted in 1 ml of PBS with 1.3 mg of m-maleimido- 
5 benzoyl -N-hydroxysuccinimide ester ("MBS") in 0.5 ml 
of dimethylformamide ("DMF"). We mixed the reaction 
mixture well and reacted it for about 1 hour at 25°C. 
Subsequently, we loaded the mixture onto a Sephadex 
G25 gel filtration column (Pharmacia, Sweden) which 

10 had been pre-equilibrated with C.l M PBS (pH 6.0). 
We collected a total of thirty 2 ml aliquot 

elution fractions and read the absorbanee of each 
fraction at 280 nm ( "^qq" ) - w « then pooled the 
three peak fractions (15, 16 and 17) to create the 

15 activated carrier. 

We dissolved 10 mg of NaBH 4 in 2.5 ml of 
0.1 M sodium borate solution to produce a sodium 
borohydride solution. Subsequently, we diluted 
approximately 8 mg of each of synthetic T4 peptides 

20 JB-1, JB-2 and JB-3 with 1 ml of 0.1 M borate buffer 

and then mixed each solution with 200 pi of the sodium 
borohydride solution, incubating the mixture on ice 
for 5 minutes. We then warmed each peptide solution 
to 25°C, brought each solution to pH 1.0 with 1 N 

25 HCl (during which frothing occurred) and then brought 
each solution to pH 7.0 with 1 N NaOH (after the 
frothing had stopped). 

We then coupled each peptide to BTG by 
adding 1-2 ml of the peptide solution to 6 ml of the 
~30 activated carrier solution. We allowed the coupling - 
- reaction to proceed overnight by incubating the reac- 
* tion mixture at room temperature . 

( ii ) Inoculation Of Test Animals 

We dissolved each of the BTG-coupled pep- 
35 tides prepared above in sterile Freund's complete 

adjuvant, to a final concentration of 1 ^g/ml coupled 
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peptide in PBS. Subsequently, v*^ inoculated each of 
three rabbits (New Zealand white) by intramuscular 
injection of 500 pg of one of the coupled peptides 
into each rabbit. We inoculated a fourth rabbit 
(New Zealand white) in the same manner with a mixture 
of the three coupled peptides. All rabbits were 
prebled prior to boosting to establish an average 
baseline for each response to be measured. The 
rabbits were boosted at 6 weeks with 500 pg coupled 
peptide in incomplete Freund's adjuvant. 

Serum was collected from each rabbit monthly 
for 4 months after immunization. The serum was then 
assayed for anti pep tide titer. 

(iii) ELISA with Antipeptide Sera 

Aqa^^t Peptide Coated Plates 

In this assay, we determined that antiserum 
raised in an animal by each of peptides JB-1, JB-2 
and JB-3 binds to that peptide. Accordingly* those 
peptides are immunogenic and elicit a response in 
test animals. 

To carry out the assay, we coated Immulon-2 
(Dynatech Labs, Alexandria, Virginia) microti ter 
plates with 50 pi per well of 50 pg/ml uncoupled 
peptide in PBS and incubated the plates overnight at 
4°C. Plates coated with peptide 46R*, which served 
as controls, were treated identically. We then washed 
the plates 4 rimes with PBS-Tween (0.5%) and 4 times 
with water. The plates were blotted dry by gentle 
tapping_over paper towels . After blotting, the plates , 



* Peptide 46 corresponds to amino acids ( " AA" ) 
728-751 of the env gene of the HIV genome. The amino 
acid numbering corresponds to that set forth for the 
env gene in L. Ratner et al . , "Complete Nucleotide 
Sequence Of The AIDS Virus, HTLV-III", Nature , 313, 
pp. 277-84 (1985). Peptide 46 has the sequence: 
LP I P RGPDRPEG I EEEGGERDRDR . 
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we added 200 pi of a 5% FCS/PBS soluti n to each 
well and incubated the plates for 1 hour at r om 
temperature . 

We then assayed serum samples from the 
5 rabbits on the pre-coated plates prepared as described 
above. We assayed the antibody response to the 
immunogen peptide at an initial dilution of 1:100, 
followed by serial 10-fold dilutions in 5% FCS/PBS. 

After a 2 hour incubation period at room 

10 temperature, we washed the platen and blotted them 
dry as described above. We then added 50 til of a 
1:1500 dilution of horseradish peroxidase ("HRP")- 
conjugated goat anti -rabbit- I gG [Cooper Biomedical, 
Malvern, Pennsylvania] in 5% FCS/PBS to each well 

15 and incubated the plates at room temperature for 

1 hour. We washed the plates with PBS-Tveen 0.5%. 
We then added 50 pi of 0.42 mM TMB. We stopped the 
enzyme reactions with 50 pi of 2 M B^SO^. We then 
analyzed the plates spectrophotometrically at 450 run 

20 using a microti ter plate reader [Dynatech Labs, 
Alexandria , Virginia] . 

We observed that antiserum against each of 
peptides JB-1, JB-2 and JB-3 binds to the corre- 
sponding peptide. We also observed that antiserum 

25 against a mixture of peptides JB-1, JB-2 and JB-3 

binds to peptides JB-1 and JB-3 under the conditions 
set forth above. The titers of each of the four 
antisera tested against the peptides in the solid- 
phase ELISA are shown below, where "ND" represents 

_30 values - not -determined: — — 

Approximate Titer Against: 

Peptide JB-1 JB-2 JB-3 

JB-1 >l/50,000 0 ND 

JB-2 0 1/50,000 ND 

35 JB-3 0 0 1/10,000 

JB-1 + JB-2 + JB-3 1/4,000 ND 1/7,000 



3683 



89085519 



PCT/L'S88/U2y40 



-34- 

Ig fractions from two of the three anti- 
peptide sera raised against indi idual peptides, 
anti-JB-1 and anti-JB-2, recognized the 55 Kd T4 
antigen band of native solubilized T4 in a Western 
5 blot analysis of protein eluted from the 19Thy 
(anti-T4) monoclonal antibody affinity column 
described above. As in the case of the radioimmuno- 
assay of native solubilized T4, the detection of the 
55 Kd protein coincides with its apparent elution 

10 from the affinity column. This provides further 

evidence that our T4 purification procedure enriched 
for solubilized T4. 

Thus, these polyclonal sera are useful in 
the detection of nanogram quantities of T4 (both 

15 native and recombinant forms) by Western analysis. 

Binding of Cell-Free T4 To HIV Envelope 

We then tested our purified solubilized 
native T4 isolated from U937 cells for its ability 
to bind to the HIV envelope protein gpl60/gp!20. To 

20 carry out this direct binding assay, we incubated 
35 S-labelled gp!60/gpl20 detergent cell extract 
derived from a recombinant cell line 7d2 (a gift 
from Drs- Mark Kowalski and William Has el tine, Dana- 
Farber Cancer Institute) with samples of solubilized 

25 native T4, each of which had been preincubated with 
one type of monoclonal antibody. 

More specifically, we mixed 5 pi of solu- 
bilized T4 in a micro fuge tube with 5 pg (about 3 pi) 
-of OKT4 (ATCC_#CRL 8002), a monoclonal antibody 

30 recognizing an epitope on T4 which does not interfere 
with HIV binding [J- A. Hoxie et al., J. Immunol. , 
136, pp. 361-63 (1986)] or with 5 pg of OKT4A (Ortho 
Diagnostics #7142), a monoclonal antibody that inter- 
feres with HIV binding to T4 positive cells [J. Steven 

35 McDougal et al., J, Immunol. , 137, pp. 2937-2944 

(1986)]. Alternatively, we mixed 50 pi of solubilized 
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T4 with 5 pg of aHTLV III gp!20 (Dupont #NEN-9284). 
we then incubated the mixtures or* ice for 1 hour. 

Subsequently, we added 150 pi of 35 S- 
labelled gpl60/gpl20 cell extract or 35 S-labelled 
5 control cell extract (precleared with protein- A 

Sepharose) to the preincubated solubilized T4/mono- 
clonal antibody mixtures and rocked the tubes over- 
night at 4°C. we then precipitated the T4/gpl60/gpl20 
immune complexes by adding 30 pi of protein-A 

10 Sepharose to each tube and rockir.g for 2 hours at 

4°C to allow the protein-A Sepharose to bind to the 
antibody complexes. Subsequently, we spun down the 
beads in an Eppendorf microfuge and after extensive 
washings, we eluted with 40 jj1 SDS sample buffer at 

15 65°C for 10 minutes. We then loaded 20 pi of the 

eluted material on a 7.5% SDS-PAGE gel which was run 
under reducing conditions. 

Figure 2 depicts autoradiograph and western 
blot results of the T4/gpl60/gpl20 coimmunoprecipita- 

20 tions. In Figure 2, lanes 1-5 were autoradiographed 
after treatment with 40% sodium salicylate and lanes 
6-7 were developed on a Western blot with rabbit 
antisera JB-2. 

As shown in Figure 2, gp!60/gp!20 protein 

25 was coimmunoprecipitated in the presence of T4 with 
OKT4 (lane 5) but not in the presence of T4 with 
OKT4A (lane 4). Lane 3 shows the positive control 
for gp!60/gpl20 using aHTLV III gpl20 monoclonal 
antibody. Neither negative control with 35 S-labelled 

30 control extract (lane T) or protein-A Sepharose alone 
(lane 2) showed bands migrating in the position of 
gpl60/gpl20. Based upon the bands that developed 
on the Western blot, the amount of T4 precipitated 
with either OKT4 (lane 6) or OKT4A (lane 7) appeared 

35 to be similar. 

This demonstrates that purified, solubilized 
native T4, which is naturally membrane bound, can 
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still interact with the HIV glycoprotein in solution. 
Acc rdingly, we believe that cell free soluble T4 is 
useful in preventing the binding interaction between 
HIV and the T4 receptor of T4 + lymphocytes. By com- 
5 peting with cell surface T4 for binding to the HIV 

envelope protein gpl20, soluble T4 is useful in block- 
ing HIV infection. 

Synthesis Of Oligonucleotide DNA Probes 

The nucleotide sequence and a deduced amino 
10 acid sequence for a cDKA that purportedly encodes 
the entire human T4 protein have been reported 
f Maddon et al- , (1985), supra]. The deduced primary 
structure of the T4 protein reveals that it can be 
divided into domains as demonstrated below: 

15 Amino Acid 

Structure/Proposed Location Coordinates 

Hydrophobic/Secretory Signal -23 to -1 

Homology to V- Regions/ 

Extracellular +1 to +94 

20 Homology to J-Regions/ 

Extracellular +95 to +109 

Glycosylated Region/ 

Extracellular +110 to +374 

Hydrophobic/Transmembrane 
25 Sequence +375 to +395 

Very Hydrophilic/ 

Intracytoplasmic +396 to +435 

Based on the sequence for the above-listed 
domains, we chemically synthesized antisense 

30 oligonucleotide DNA probes using conventional phos- 
p ho amide DNA synthesis techniques. See, e.g.. 
Tetrahedron Letters , 22, pp. 1859-62 (1981). We 
synthesized the probes on an Applied Biosystems 380A 
DNA synthesizer and purified them by gel electro- 

35 phoresis. 
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Furthennore . we synthesized the pr bes 
such that they were c mplementary to the DNA 
sequences which code for the amino acid sequence, 
i.e., the probes were antisense, to enable them to 
5 recognize and hybridize to the corresponding sequences 
in DNA, as well as in mRNA. The nucleotide sequences 
of the eleven selected regions of the T4 protein 
[corresponding to the nucleotide numbering set forth 
in Maddon et al . , (1985), supra) were the following: 



10 Nucleotide 

Oligonucleotide Coordinates 

1 145-171 

2 742-765 

3 1414-1440 
15 6 427-453 

7 1303-1329 

8 1012-1038 

9 97-118 
10 10-36 

20 11 1698-1724 

12 397-423 

14 261-287 



Before us in g our DNA probes for screening, 
we 5* end-labelled each of the single- stranded DNA 
25 probes with 32 P using [y- 32 P]-ATP and T4 polynucleo- 

tide ki nase , substantially as _des cr ibejd„ by _ A.,_M._ Max am 

and W. Gilbert, "A New Method For Sequencing DNA", 
Proc. Natl. Acad. Sci . USA , 74, pp. 560-64 (1977). 

Construction of XgtlO Peripheral Blood 
30 Lymphocytes cDNA Library 

To prepare our Peripheral Blood Lymphocytes 
( PBL ) cDNA library, we processed PBL, from a single 
leukophoresis donor, through one round of absorption 
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to remove monocytes. We then stimulated the non- 
adherent cells with IFN-y 1000 j/ml and 10 vg/ml PHA 
for 24 hours. We isolated RNA from these cells using 
phenol extraction [Maniatis et al . , Molecular Cloning , 
5 p. 187 (Cold Spring Harbor Laboratory) (1982)] and 

prepared poly A* mRNA by one round of oligo dT cellu- 
lose chromatography. We ethanol precipitated the 
RNA, dried it in a speed vac and resuspended the RNA 
in 10 pi (0.5 pg/Ml). We treated the RNA for 10 

10 min at room temper attire in GH^EgOH (5 mM final con- 
centration) and 0 -mercaptoe thanol (0.26 M) . We then 
added the methyl mercury treated RNA to 0.1 M Tris-HCl 
(pH 8-3) at 43°C, 0.01 M Mg, 0.01 M DTT, 2 mM Vanadyl 
complex, 5 pg oligo ^^-is' 20 KC1 » 1 dCTP, 

15 dCTP, dTTP, 0.5 mM dATP, 2 pCi (o- 32 P]dATF and 30 U 

1.5 m! AMV reverse transcriptase (Seikagaku America) 
in a total volume of 50 pi. We incubated the mixture 
for 3 minutes at room temperature and then for 3 hours 
at 44°C, after which time we stopped the reaction by 

20 the addition of 2.5 pi of 0.5 M EDTA. 

We extracted the reaction mixture with an 
equal volume of phenol: chloroform (1:1) and precipi- 
tated the aqueous layer two times with 0.2 volume of 
10 M NH 4 AC and 2.5 volumes EtOH and dried it under 

25 vacuum. The yield of cDNA was 1.5 pg. 

We synthesized the second strand according 
to the methods of Okayama and Berg [ Mol. Cell. Biol. , 
2, p. 161 (1982)] and Gubler and Hoffman [ Gene , 25, 
pp. 263-69 (1983)], except that we used the DNA poly- 

30 merase I large fragment in the synthesis. 

We blunt ended the double-stranded cDNA by 
resuspending the DNA in 80 m1 TA buffer (0.033 M Tris 
Acetate (pH 7.8); 0.066 M KAcetate; 0.01 M MgAcetate; 
0.001M DTT; 50 \*q/ml BSA ) , 5 \iq RNase A, 4 units RNase 

35 H, 50 mM 3 NAD , 8 units E.coli ligase, 0.3125 mM 
dATP, dCTP, dGTP, and dTTP, 12 units T 4 polymerase 
and incubated the reaction mixture for 90 min at 
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37°C, added 1/20 volume of 0.5M EDTA, and extracted 
with phenol: chloroform. We chrcaatographed the 
aqueous layer on a C150 Sephadex column in 0.01M 
Tris-HCl (pH 7.5), 0.1 M NaCl, 0.001 M EDTA and 
5 collected the lead peak containing the double-stranded 
cDNA and ethanol precipitated it. Yield: 0.605 pg 
cDNA. 

We ligated the double-stranded cDNA to 
linker 35/36: 

10 5 • AATTCGAGCTCGAGCGCGGCCGC3 • 

3 1 GCTCGAGCTCGCGCCGGCG5 9 

using standard procedures . We then size selected 
the cDNA for 800 bp and longer fragments on a S500 
Sephacryl column, and ligated it to EcoRI -digested 

15 bacteriophage lambda vector gtlO (a gift of 

Dr. Ellis Reinherz). We packaged aliquots of the 
ligation reaction in Gigapak (Strategene) according 
to the manufacturer's protocol. We used the packaged 
phage to infect E.coli BNN102 cells and plated the 

20 cells for amplification. The resulting library con- 
tained 1.125 x.10 6 independent recombinants. 

We also screened a PBL cDNA library in the 
bacteriophage lambda vector gtlO (a gift of Dr. Ellis 
Reinherz), which was synthesized from mRKA from a 

25 T4 + tumor cell line named REX, which expresses T4 

protein at high levels [O. Acuto et al. # "The Human 
T Cell Receptor: Appearance In Ontogeny And 
Biochemical Relationship Of Lambda and Beta Sub units 
on IL-2 Dependent Clones And T Cell Tumors", Cell , 

30 34, pp. 717-26 (1983)]. ~ 

Screening Of The Libraries 

We then used three of our 32 P-labelled 
synthetic oligonucleotide antisense probes, probes 3, 
6 and 9, to screen in parallel our two XgtlO cDNA 
35 libraries using the plaque hybridization screening 

technique described in R. Cate et al . , "Isolation Of 
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The Bovine And Human Genes For Mullerian Inhibiting 
Substance And Expression Of The I uman Gene In Animal 
Cells", Cell , 45 , pp. 685-98 (1986), with minor 
modifications. We modified the Cate et al. proce- 
5 dure by hybridizing without tetramethyl ammonium 
chloride to accommodate our use of unique probes, 
rather than mixtures, to probe the plague filters. 

We used the three probes, which had been 
previously 5* end-labelled with [*- 32 P]-ATP according 
10 to the method of A. Max am and w. Gilbert, Meth. 

Enzymol . , 68, pp. 499-80 (1979) to screen in parallel 
the PBL cDNA library and the REX cDNA library dis- 
cussed above. 

From our screening of the PBL library, we 

15 isolated a nearly full length soluble T4 cDNA clone 

A203-4 (or AgtlO.PBL.T4) — containing a 3.064 kb 
insert which could be cleaved from the AgtlO vector 
with EcoR I . 

From our screening of the REX cell library, 
20 we isolated an incomplete T4 cDKA clone containing 

a 1,200 bp cDNA insert. We then further characterized 
the DNA from these clones by DNA sequencing analysis. 

w * also screened a bacteriophage lambda 
human genomic library, constructed in the vector 
25 EMBL3 by Dr. Mark Pasek (Biogen Inc., Cambridge, 

Massachusetts) [N. Murray in Lambda 2, eds. R. Hendrix, 
J. Roberts, F. Stahl, R. Weisberg, pp. 3935-422 (1983)] 
The library contains DNA fragments, created by partial 
restriction of chromosomal DNA from the human lympho- 
30 blastid cell line GM1416,48, XXXX (Human Genetic 
Mutant Cell Repository, Camden, New Jersey) with 
Sau3a, ligated onto EMBL3 arms which had been sub- 
jected to cleavage with BamH I according to the pro- 
cedures outlined in Maniatis et al. . (1982), supra. 
35 Plating of the phage library, lysis, and transfer of 
the phage DNA onto nitrocellulose were performed as 
described by w. D. Benton and R. W. David , "Screening 



3690 



S9085519 



WO8V/U1V40 



-41- 

f Lambda gt Recombinant Clones By Hybridization To 
Single Plaques In SiwU", Scienc e, 196, p. 180 (1977) 
and Maniatis et al . (1982). Hybridization conditions 
were those described by Cate et al . (1986), supra. 
5 except that te trame thy 1 ammonium chloride (TMAC1) was 
omitted from the washing buffer. 

Approximately 2 million plaques were 
screened in parallel hybridizations with probe 1 
and probe 3 discussed above. One phage, called 
10 CM47 , which hybridized with probe 3 in the pr imar y 
screenings, was subjected to DNA sequence analysis 
to determine the existence and position of an intron 
between the coding sequences for the predicted extra- 
cellular and transmembrane domains. No phage clones 
15 containing T4 sequences were found screening with 
probe 1. probably because it includes a sequence 
interrupted by an intron [D. R. Littman and S. N. 
Gettner, Nature, 325, pp. 453-55 (1987); and our 
observations] . 
20 Partial sequence analysis of CM47 shows 

that an intron interrupts the sequence corresponding 
to the codon for valine (amino acid 363) of the 
deduced primary sequence for T4 (Figure 3 — in which 
introns are indicated by a solid line). This intron 
25 defines a potential site for introducing a stop codon 
in order to express a soluble form of T4. Another 
intron found within the coding sequence for T4 inter- 
rupts the codon for arginine (amino acid 295) and a 
third intron in CM47 is found between the codons for 
-30 -arginine (amino acid 402 )_ and arginine (amino acid 
403) (Figure 3). 

Sequencing Of cPNA Clones 

We then subcloned Eco RI digested DNA from 
clone A203-4 into animal expression vector pBG312 
35 f R. Cate et al * , supra] to facilitate sequence 

analysis. More specifically, as depicted in Figure 4, 
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we then digested Xgtl0.PBL.T4 w: *Ji EcoRI to excis 
the 3.064 kbp Eco RI - Eco RI fragment containing the 
full length T4 cDNA. This cDNA sequence, including 
the entire coding region for soluble T4 and for full 
5 length T4 was deposited in pl70-2. We used T4 ligase 
to ligate the fragment into animal expression vector 
pBG312 [supra] which had been previously cut with 
EcoR I , to form pBG312.T4 and pl70-2 (Figure 4). We 
then determined the nucleotide sequence of the EcoR I 

10 fragment of pBG312.T4 using Max am Gilbert technology 
[A. M. Max am and W. Gilbert., "A New Method For 
Sequencing DMA", Proc. Natl. Acad. Sci. USA , 74, 
pp. 560-64 (1977)] (see Figure 3, which depicts the 
PBL cDNA sequence in comparison to that reported by 

15 Maddon et al . , (1985), supra). This analysis showed 
that the 3.064 kbp PBL full length complementary DNA 
copy of T4 cDNA contained the coding sequence for 
T4, approximately 200 bp of 5* noncoding sequence 
and approximately 1500 bp of 3* noncoding sequence. 

20 We then cut pBG312.T4 with PstI and removed 

the resulting 3 9 protruding ends with Klenow and 
isolated an approximately 2.5 kbp fragment. We then 
inserted the fragment into the poly linker of pBG312 
(which had been previously restricted at the Sma l 

25 site) to form plasmid pl70-2, which contains the 
full length PBL T4 cDNA sequence (see Figure 3). 

As depicted in Figure 3, the PBL T4 cONA 
contains a nucleotide sequence almost identical to 
the approximately 1*7 00 ^P sequence reported by 

30 Maddon et al . , (1985), supra. The PBL T4 cDNA, how- 
. ever, contains three nucleotide substitutions that, 
in the translation product of this cDNA, would pro- 
duce a protein containing three amino acid substi- 
tutions compared to the sequence reported by Maddon 

35 et al. As shown in Figure 3, these differences are 
at amino acid position 3 , where the asparagine of 
Maddon et al . is replaced with lysine; position 64, 
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where the tryptophan of Maddon t al. is replaced 
with arginine and at position 231, where th phenyl- 
alanine of Maddon et al . is replaced with serine. 
The asparagine reported at position 3 of Maddon 
5 et al. instead of lysine was the result of a sequenc- 
ing error (Dr. Richard Axel, personal communication) . 
The significance of the amino acid replacements at 
positions 64 and 231, which may represent allellic 
polymorphism [T. C. Fuller et al. , Human immunolocry , 
10 9, pp. 89-102 (1982); W. Stohl and H. G. Kunkel, 

Scand . J ■ Immunol . , 20, pp. 273-78 (1984); N. Amino 
et al.. Lancet , 2, pp. 94-95 (1984); and M. Sato 
et al., J. Immunol. , 132, pp. 1071-73 (1984)], is 
not known. 

15 DNA sequence analysis [Max am and Gilbert , 

supra] of the insert in pEClOO of the REX clone sug- 
gests that it represents the product of a splicing 
error, because 5* noncoding sequence appears to have 
been spliced with coding sequence beginning with the 

20 GGT codon for glycine (amino acid 49) (see Figure 3 
and Figure 5). The T4 coding sequence in pEClOO* 
from glycine ( amino acid 49) to isoleuclne (amino 
acid 435) is identical to the sequence of Maddon 
et al . , (1985), supra. 

25 In comparison, our earlier N- terminal pro- 

tein sequence analysis of native T4 protein purified 
from U937 cells shows a T4 expression product with 
aspargine as amino acid 3 . These differences are 
also set forth in Figure 6, which also depicts com- 

30 parisons at corresponding positions of the partial 
.clone from the REX cell line XgtlO library; our 



* We constructed pEClOO by digesting the incomplete 
T4 cDNA clone from the REX library with Eco RI and 
35 isolating the 1,200 bp cDNA insert. We then ligated 
it to pUC12 (Boehringer Mannheim, Indianapolis, 
Indiana) which had been previously cut with Eco RI to 
form pEClOO. 
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genomic clone from a AJ21BL3 library; m use T4 
sequences [Tourvieille et al . , Science , 234, p. 610 
(1986)] and sheep T4 sequences [Classon et al . , 
Immunogenetics , 23, p. 129 (1986)]. 

5 Construction of Soluble T4 Mutants 

We employed the technique of in vitro 

site-directed mutagenesis and restriction fragment 
substitution to modify the T4 cDNA coding sequence 
of pl70-2 in sequential steps to be identical to 

10 that reported by Maddon et al. , (1985), supra. We 
first used oligonucleotide-directed mutagenesis to 
modify the amino acids at positions 3 and 64. Next, 
we employed restriction fragment substitution with a 
fragment including the serine 231 codon of a partial 

15 T4 cDNA isolated from a T4 positive lymphocyte cell 
line [O. Acuto et al.. Cell , 34, pp. 717-26 (1983)] 
library in Xgtll (a gift from Dr. Ellis Reinherz), 
to modify the amino acid at position 231. We then 
truncated our modified T4 cDNA sequence to remove 

20 the coding regions for the transmembrane and intra- 
cytoplasmic domains. Subsequently, we constructed 
three different soluble T4 mutants from our full 
length T4 clone PBL T4 by linker insertion between 
restriction sites in order to increase the probability 

25 of empirically finding a stable, secretable T4 mole- 
cule. The structure of each of these mutants is 
depicted in Figure 7A. 

Line A of Figure 7A represents a hydropathy 
analysis of our full length soluble T4 carried out 

30 using a computer program called Pepplot (University 
of Wisconsin Genetics Computer Croup) according to 
J. Kyte and R. F. Doolittle, J, Mol . Biol. , 157 , 
pp. 105-32 (1982). Line B depicts the protein domain 
structure of full length T4 f Maddon et al . , (1985) 
35 supra] in which represents the secretory signal 

sequence, "V" represents the immunoglobulin-like 
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variable region sequence, "J" represents the immuno- 
globulin-like joining region sequence, "U" represents 
the unique, extracellular regi n sequence, "TM" 
represents the transmembrane sequence and "C" repre- 
5 sents the cytoplasmic region sequence. In line B, 

the transmembrane amino acid sequence and some flank- 
ing sequence is written below the TM domain. Line C 
depicts the protein domain structure of recombinant 
soluble T4 mutants rsT4.1 in pBG377, rsT4.2 in pBG380 

10 and rsT4.3 in pBG381. Line D represents the protein 
domain structure of E.coli rsT4 gene (Met-perfect 
construct) (p!99-7) which is deleted for the T4 
N- terminal signal sequence (S). 

We constructed the first three soluble T4 

15 mutant gene fragments by truncating our full length 
soluble T4 cDNA at positions corresponding to either 
intron/exon boundaries or to protein domain boundaries 
defined by hydropathy analysis predictions. More 
specifically, we introduced synthetic linkers into 

20 the unique Ava l site that is 5' to the transmembrane/ 
extracellular domain boundary to produce an in- frame 
translational stop codon, thus constructing T4 genes 
that lack, the transmembrane and cytoplasmic domains 
of the full length T4 sequence* 

25 For example, mutant rsT4.1 in pBG377 was 

truncated by the insertion of a stop codon following 
amino acid 362, lysine, which corresponds to the 
position of an intron separating the extracellular 
and transmembrane domain exons. The positions both 

30. of_this intron and of the adjacent intron that splits 
the transmembrane and cytoplasmic domains were deter- 
mined by DNA sequence analysis of chromosomal T4 
clones isolated from the A.EMBL3 genomic library 
described above. Although the significance of the 

35 intron positions flanking the T4 transmembrane domain 
is not known, the determination of the genetic struc- 
ture could provide important information for design- 
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ing rsT4 mutants, sir re exons frequently define 
functional domains [W. Gilbert, "Why Genes In Pieces?", 
Nature , 271, p. 501 (1978)]. 

We then constructed mutant rsT4.2 in pBG380 
5 by truncating the T4 cDNA at the boundary of the 
transmembrane and extracellular domains at amino 
acid 374. And, we constructed mutant rsT4.3 in 
pBG381 by truncating the T4 cDNA at amino acid 377, 
three amino acids downstream from the transmembrane/ 

10 extracellular domain boundary and within the trans- 
membrane domain . 

We also employed the technique of oligo- 
nucleotide site directed mutagenesis, according to 
D. Strauss et al., "Active Site Of Triosephosphate 

15 Isomerase: In Vitro Mutagenesis And Characterization 
Of An Altered Enzyme", Proc. Natl. Acad. Sci. USA , 
82, pp. 2272-76 (1985), to construct a fourth soluble 
T4 mutant from our full length T4 clone PBL T4. The 
structure of this mutant is depicted in Figure 7A, 

20 line D, which represents the protein domain structure 
of E.coli rsT4 gene (Met-perfect rsT4.2) construct, 
deposited in pl99-7 # which is deleted for the T4 
N- terminal signal sequence (S). 

We also constructed various other soluble 

25 T4 deletion mutants to determine which smaller 

fragments of the T4 sequence provide a protein which 
binds to HIV. These constructions were based on our 
belief tliat only the amino terminal sequence of T4 
is required for binding to HIV. This belief, in turn, 

30 - was based upon observations that the monoclonal anti- 
body 0KT4A blocks infection of T4 positive cells by 
HIV and it appears to recognize an epitope in the 
amino portion of T4 [ Fuller et al ■ , supra]. Such 
fragments of T4, which lack glycosylation and which 

35 are capable of binding HIV and blocking infection, 

may be produced in E.coli or chemically synthesized. 
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The structure of each of these deletion 
mutants is depicted in Figure 7B- In that figure, 
line A depicts the protein domain structure of full 
length T4 f Maddon et al . , (1985), supra; Figure 7A] . 
5 In line B, the protein structure of recombinant 

soluble T4 mutants are depicted as follows: rsT4.7 
in p203-5, rsT4.7 in pBG392, rsT4.8 in pBG393, rsT4.9 
in pBG394, rsT4.10 in pBG395, rsT4.11 in pBG397, 
rsT4.12 in pBG396, rsT4.111 in pBG215-7, rsT4. 113.1 

10 in pBG211-ll and rsT4.113.2 in pBG2 14-10. 

We constructed soluble T4 derivatives 
p203-5, pBG392, pBG393, pBG394 and pBG396 by trun- 
cating our rsT4.2 gene after the Stu I sites at amino 
acids 183 and 264 of rsT4.2- More specifically, we 

15 constructed derivative rsT4 . 7 in p203-5 and in pBG392 
by truncating the rsT4 . 2 cDNA at amino acid 182. 
And, we constructed each of derivatives rsT4.9 in 
pBG394 and rsT4.12 in pBG396 by truncating the rsT4.2 
cDNA at amino acids 113, and 166, respectively. One 

20 may also construct each of derivatives rsT4.10 in 
pBG395 and rsT4-ll in pBG397 by truncating the 
rsT4.2 cDHA at amino acids 131 and 145, respectively. 

Expression of T4 and Soluble T4 
Polypeptides In Bacterial Cells 

25 The cDNA sequences of this invention can 

be used to transform eukaryotic and prokaryotic host 
cells by techniques well known in the art to produce 
recombinant soluble T4 polypeptides in clinically 
and commercially, useful amounts.. 

30 _ For example, we constructed expression 

- vector pl99-7, as shown in Figure 9A, as follows. 

We preceded the construction depicted in 
Figure 9A by the construction of various intermediate 
plasmids, as depicted in Figures 8A-8D. Those con- 

35 structions were carried out using conventional 
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rec mbinant techniques. The li-kers employed in 
those constructions are set f rth in Figure 10. 

As depicted in Figures 8A and 8B, starting 
with pl70-2, which contains our full length T4 DNA 
5 sequence, coding for T4 characterized by three dif- 
ferent amino acids than that of Maddon et al . , (1985), 
supra, we produced various constructs which direct 
the expression of soluble T4. Some of these con* 
structs are characterized in that one or more of 
10 those amino acid differences have been changed to 
correspond to the respective amino acids of Maddon 
et al. In this figure, as well as in the other 
figures, amino acid changes are reflected by an 
arrow. 

15 Plasmid p!92-6 contains the Met perfect 

rsT4.2 sequence derived by oligonucleotide site- 
directed mutagenesis which removed the entire T4 
N- terminal signal sequence as shown in Figure SC. 
And, to provide a convenient means of transferring 

20 the rsT4.2 Met perfect sequence into E.coli expression 
vectors, the steps described in Figure 8D were carried 
out to produce p!95-8, a plasmid containing the Met 
perfect rsT4.2 sequence flanked by Cla l restriction 
sites. The Cla l - Cla l cassette of pl9S-8 optimizes 

25 the distance between the S* Cla l site and the 
Initiating Met codon. In Figure 8D, ST8 rop" 
is a tetracycline resistance encoding pAT153- 
based plasmid containing the rop" mutation that 
permits high plasmid copy number, a promoter and 

30 rlbosome binding site from bacteriophage gene 32 and 
the gene 32 transcription termination sequence. 

Cleavage of pi 95-8 with Cla l produced the 
fragment used to assemble pl99-7, a construction 
which directs the expression of Met perfect rsT4-2 

35 under the control of the promoter (Figure 9A). 

As the first step, to construct a vector from which 
rsT4.2 expression is under control of the P r promoter. 
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we constructed the vector pl97-l** from pl034 
( plmuGCSF ) ( Figure 9A ) . 

We then cut pl034 with EcoRI and BamHI to 
excise the GCSF cDNA insert and a portion of the 
5 phage mu ribosome binding site sequence — which we 
subsequently reconstructed with oligonucleotides. 
The synthetic linkers used were linkers 57-60 
( Figure 10 ) • 

We then ligated the synthetic linker into 

10 the EcoR I /BamH I -cut pl034 to form pl97-12. One could, 
instead, replace these steps by starting with any 
suitable E.coli expression vector containing a cla l 
site appropriately placed between the promoter and 
terminator sequences. We cut p!97-12 with Cla l and 

15 inserted a Clal -Clal cassette containing the cDNA 

sequence of rsT4.3 in pBC381 and phage transcription 
terminator derived from p!034. The sequence of this 
cassette is depicted in Figure 11. The resulting 
plasmid, pl99-7. contains the rsT4.2 *Met perfect" 

20 gene in that vector. 

Alternatively, one could derive the Met 
perfect rsT4.2 sequence from plasmid pBG380, 
deposited in connection with this application, and • 
gap out the signal sequence to create pl92-6. 

25 We tested for expression of pl99-7 as 

follows. SG936, an E.coli Ion htpr double mutant 
[ATCC 39624] [S. Goff and A. Goldberg, "ATP-Dependent 
Protein Degradation In E.coli " , in Maximizing Gene 
- Expression , _W. Reznikof f and L Gold ( eds * ) ( 1986 ) ] , 

30 .was transformed with pl99-7 by conventional proce- 
dures f Maniatis et al. (1982)] to form SG936/pl99-7 , 
a transfonnant containing a plasmid with the Met- 
perfect rsT4.2 gene behind the P L promoter. Trans- 
formants were selected on LB agar plates containing 

35 10 mcg/ml tetracycline (tet). After streaking out 

several single colonies for single colony isolation, 
ne was chosen at random for testing induction f 
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rsT4.2 synthesis. We picked a single colony from an 
LB- agar tet + plate into 20 ml L..ria Broth (LB) and 
10 mcg/ml tet in a 125 ml shake flask and grew it 
overnight in a shaking air incubator (New Brunswick 
5 Scientific, New Jersey) at 30°C. 

We then initiated an induction culture by 
adding 0.5 ml of the overnight culture to 50 ml LB 
and tet in a 500 ml flask which was grown at 30°C in 
a shaking air incubator. When the culture reached 

10 an OD(600) of 0.4, we transferred it to a 42°C water- 
bath and shook it gently for approximately 20 minutes. 
After heat induction at 42 °C, the flask was trans- 
ferred to a 39°C air incubator (New Brunswick 
Scientific, New Jersey) where it was shaken vigorously 

IS at 250 rpm. We withdrew samples just after the 42°C 
heat shock, and at hourly time points for 4 hours, 
and +y*~r\ after overnight growth. The samples were 
measured for growth by OD(600) and analyzed following 
SDS-PAGE for the pattern of protein synthesis by 

20 Coomassie blue protein staining and by Western blot 
analysis with our rabbit antipeptide antibody probes 
(described above). Based on the relative molecular 
weight and protein blot analysis, the expression of 
rsT4.2 was induced from SG936/pl99-7 following heat 

25 induction at 42°C (Figure 12). 

We transformed p!99-7 into a P^mu.tet 
expression vector, an E.coli expression vector, at 
the unique Clal site (see Figure 11). The nucleo- 
tide and amino acid sequences of pl99-7 are shown in 

_30 Figure 11. 

The expression of soluble T4 from pl99-7 
-in E.coli was measured by Western blot analysis of 
whole cell extracts following SDS-PAGE using the 
rabbit polyclonal anti-peptide JB-1 or anti-peptide 

35 JB-2 antibodies as probes (Figure 12). 

We also constructed expression vector 
p203-5, as shown in Figure 9B, as follows. 
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We started with pl97-7, which has the same 
sequence as the P L um vector pl97-l_; (see Figure 9A), 
except that there is a single nucleotide deletion in 
the 5' noncoding region following the P L promoter. 
5 That deletion , which is a deletion of nucleotide 

#40 — adenine — of pl97-12 (see Figure 11 ), resulted 
from a deletion in the region that was constructed 
from linkers 57-60 (see Figure 10). pl97-7 contains 
the rsT4.2 gene comprising 374 amino acids. Alter - 
10 natively, one could also use pl97-7 as a starting 
plasmld. 

We cut pl97-7 with Cla l . We also cut pl95-8 
(see Figures 8D and 9A) with Cla l to remove the 
Cla l - Cla l cassette containing the cDNA sequence of 

15 rsT4 . 2 - Subsequently, we inserted the Clal -Clal 
cassette into pl97-7 to produce p!98-2. 

We then digested p!98-2 with Stu I to 
remove 80 amino acids ( amino acid 185 to amino acid 
264) of the mature T4 protein coding sequence. Unex- 

20 pected methyl a tl on, however, prevented cutting at 

the second Stu I site, so that only the Stu I site at 
amino acid 184 was cleaved. Following ligation, the 
plasmld DNA was transformed Into E.coli and we 
examined several plasmld clones for the deletion 

25 using standard procedures. None of those plasmids 
contained the expected Stu I deletion. 

Subsequent DNA sequence analysis of one 
of these plasmids, called p203-5, showed that two 
guanine residues (see amino acids 183 and 184; 

-30 -nucleotides 818 and -819 of Figure 3) of -the Stu I .... 
recognition sequence had been deleted following 
cleavage due to exonuclease digestion caused by the 
use of exonuclease-contaminated Stu I enzyme. This 
dinucleotide deletion produced a translation frame- 

35 shift following amino acid 182 (glutamine) and intro- 
duced a stop codon six amino acid codons downstream 
from the frameshift (Figure 9C). The unexpected 
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methylation of the second StuI site together with 
the deletion that resulted in a *ew stop codon 
produced a gene encoding a shortened form of recom- 
binant soluble T4, called rsT4.7. The rsT4.7 sequence 
5 encodes a 182 amino acid N-terminal segment of the 
mature T4 sequence followed by # at the C-tenninus, 
six amino acids ~ asparagine-leucine— glutamine- 
histidine— serine— leucine — of non-T4 sequence and 
finally by a TAA stop codon. 
10 The expression of soluble T4 from p203-5 

in E , coli was measured by Western blot analysis 
as previously described. 

Expression of T4 and Soluble T4 
Polypeptides In Animal Cells 

15 We inserted both soluble T4 genes and the 

unmodified gene encoding membrane— bound T4 into 
animal expression vector pBG368. More specifically, 
we inserted each of the soluble gene constructs into 
pBG368 under the transcriptional control of the 

20 adenovirus late promoter, to give plasmids pBG377, 
pBG380 and pBG381. We also made two pBG3 12 -based 
constructions , called pBG378 and pBG379, which 
direct the expression of recombinant full length T4 
protein. pBG378 and pBG379 code for the same full 

25 length T4 protein but in pBG379, a portion of the 3 • 

untranslated sequence has been removed. Subsequently, 
to test for expression of recombinant soluble T4 and 
recombinant full length T4, we cotransfected Chinese 
_ hamster ovary ("CHO") cells with one of each of 

30 those plasmids and with the pi as mid pAdD26 . 

We first constructed pBG368 as follows . 
As depicted in Figure 13, we cut animal cell expres- 
sion vector pBG312 (R. Cate et al., "Isolation Of 
The Bovine And Human Genes For Mullerian Inhibiting 

35 Substance And Expression Of The Human Gene In Animal 
Cells", Cell , 45, pp. 685-98 (1986)] with Eco RI and 
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Bql ll to delete one of each of t *e two Eco RI and the 
two Bgl l I restriction sites (the Eco *U site at posi- 
tion 0 and the Bal l I site located at approximately 
position 99). The resulting plasmid, pBG368, retained 
an Eco RI site in the cloning region and a Bgl l I site 
after the cloning region. This left a single Eco RI 
site and a single Bgl l I site in the polylinker for 
cloning purposes. 

More specifically, we deleted one EcoR I 
site and one Bal l I site by sequential partial diges- 
tion of pBG312 with restriction enzymes EcoR I and 
Bgl l I, respectively. We filled in with Klenow and 4 
nucleotides then religated to produce pBG368, which 
contains unique restriction sites for EcoR I and Bgl l I 
enzymes* 

Once transient expression of soluble T4 
was verified, we constructed stable cell lines that 
continuously expressed soluble T4. To do this, we 
employed the stable cell expression host, the 
dihydrofolate reductase deletion mutant (DHFR~) 
Chinese hamster ovary cell line [F. Kao et al., 
"Genetics Of Somatic Mammalian Cells X Complementation 
Analysis of Glycine-Requiring Mutants", Proc . Natl - 
Acad . Sci . , 64, pp. 1284-91 (1969); L. Chasin and 
G. Uriah "Isolation Of Chinese Hamster Cell Mutants 
Deficient In Dihydrofolate Reductase Activity", 
Proc. Natl. Acad. Sci. , 77, pp. 4216-80 (1980)]. 

Using this system, we cotransfected each 
T4 gene construct with pAdD26 [R. J. Kaufman and 
P. A. Sharp, "Amplification And Expression Of ~ ~ 
Sequences Cotransfected With a Modular Dihydrofolate 
Reductase Complementary DNA Gene", J. Mol ■ Biol , , 
159, pp. 661-21 (1982) containing the mouse DHFR 
gene. Before carrying out the co- trans fections , we 
linearized all plasmids by restriction enzyme cleavage 
and, prior to transfection, we mixed each plasmid 
with pAdD26 so that the molar ratio of pAdD26 to T4 
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was 1:10. This maxim. zed the number of T4 gene copies 
per trans fectant. 

within the cell, the plasmids were ligated 
together to form polymers that can become integrated 
5 into host chromosomal sequences by illegitimate 

recombination [J. Haynes and C . Weissmann, "Constitu- 
tive, Long-Term Production Of Human Interferons By 
Hamster Cells Containing Multiple Copies Of a Cloned 
Interferon Gene", Nucl . Acids Res . , 11, pp. 687-706 

10 (1983); S. J. Scahill et al., "Expression And Charac- 
terization Of The Product Of A Human Immune Interferon 
cDNA Gene In Chinese Hamster Ovary Cells", Proc . Natl . 
Acad. Sci. USA , 80, pp. 4654-58 (1983)]. We selected 
trans f ectants that express the mouse DHFR gene in 

15 culture medium lacfcing nucleotides, we then subjected 
these trans f ectants to a series of increasing concen- 
trations of methotrexate , a toxic folate analogue 
that binds DHFR, to select for cells levels of DHFR. 

Resistance to methotrexate by increased 

20 expression of DHFR is frequently the result of DHFR 

gene amplification, which can include the reiteration 
of large chromosomal segments, called amplified 
units [R. J. Kaufman and P. A. Sharp, "Amplification 
And Expression Of Loss Of Dihydro folate Reductase 

25 Genes In A Chinese Hamster Ovary Cell Line", Molec. 
Cell. Biol. , 1, pp. 1069-76 (1981)]. Therefore, 
cointegration of DHFR and rsT4 sequences permitted 
the amplification of rsT4 genes. Stably transfected 
cell lines were isolated by cloning in selective 

30 growth medium, then screened for T4 expression with 

a T4 antigen (RIA) [D. Klatzmann et al . , Nature , 

312, pp. 767-68 (1984)] and by immunoprecipitation 

35 

from conditioned medium after [ S] cysteine 
(" 35 S-Cys" ) metabolic labelling. 
35 We also inserted the soluble T4 derivative 

rsT4.7 gene into an animal cell expression plasmid 
as follows - 
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As set fortih in Figure 14C, we cut plasmid 
pBG381 (Figure 14A) with EcoRI ai.d Nhel. we then cut 
pl86-6 with Eco RI and Nhe l to remove the 786 base 
pair fragment. We ligated that fragment into the 
5 digested pBG381 to form plasmid pBG391. The T4 
sequence in pBG391 is identical to both that of 
Maddon et al. (1985) supra at positions 64 (tryptophan) 
aind 231 (phenylalanine) and to that of pBG381. How- 
ever, at position 3, the asparagine reported by 
10 Maddon et al. and present in pBG381 is replaced with 

lysine. The nucleotide sequence of pBG391 is depicted 
in Figure 15. 

We then digested p203-5 with Nhe l and 
OxanI to remove the 483 base pair fragment. We 
15 inserted that fragment into Nhel /OxanI -digested 
PBG391 to form plasmid pBG392, the animal cell 
expression construct of rsT4 . 7 . The T4 sequence in 
rsT4.7 contains amino acids identical to that of 
Maddon et al . ' s full length sequence at amino acid 
20 positions 64 (tryptophan) and 231 (phenylalanine). 
However, at position 3, the asparagine reported by 
Maddon et al. is replaced with lysine. The nucleo- 
tide sequence of pBG392 is depicted in Figure 16. 

In Figure 14D, we have depicted the con- 
25 struction of other a nim al cell expression constructs 
containing sequences encoding the deletions rsT*.9 
in pBC3394, and rsT4.12 in pBG396. Those constructions 
were carried out using conventional recombinant tech- 
niques. The linkers employed in those constructions 
30 are" set forth in Figure 18. The nucleotide sequences 
of pBG394 and pBG396 are shown in Figures 19 and 20. 

Plasmid pBG393, shown in Figure 17 # con- 
tains rsT4.8, the perfect form of rsT4.7. pBG393 
contains 182 amino acids of the mature T4 sequence, 
35 without the additional non-T4 6 amino acids at the 

C-terminus following amino acid 182. The nucleotide 
2jq ^sequence of BG393 is shown in Figure 21. 
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Other animal cell expression plasmids 
according to this invention may oe constructed as 
depicted in Figure 17. These include rsT4.10 in 
PBG395 and rsT4.11 in pBG397 (see Figure 18 for 
5 specific linkers ) . 

The nucleotide sequence of BG395 is shown 
in Figure 22. 

Purification Of Recombinant Soluble T4 

Reco mbinan t soluble T4 construct pBG380 
10 expressed in DHFR~ CHO cells was grown to confluency 
in a a -Modified Eagles Medium (Gibco) supplemented 
with 10% fetal calf serum, 1 mM glutamine and the 
antibiotics penicillin and streptomycin (100 pg/ml 
of each). The cells were grown at 37°C in two 21 Cell 

15 Factory Systems (Nunc). We then washed the confluent 
cells free of fetal calf serum with a -Modified Eagles 
Medium without fetal calf serum and cultured the 
cells in a -Modified Eagles Medium at 37°C for 4 days. 
Subsequently, we harvested the conditioned media, 

20 filtered it through a Millipore Minidisk 0.22p 

hydrophilic filter cartridge (Millipore #MCGL 305-01) 
and concentrated the secreted proteins on a fast-S 
ion exchange column (S-Sepharose Fast Flow, Pharmacia 
#17-0511-01) in 20 mM MES buffer (pH 5.5). 

25 We then eluted the bound proteins with 20 mM 

Tris-HCl (pH 7.7) and 0.3 M NaCl. The elution pool 
was subsequently diluted with 2 volumes of 20 mM 
Tris-HCl (pH 7.7) and it was then loaded on a column 
comprising immobilized 19Thy anti-T4 monoclonal anti- 

30 body coupled to Affigel-10 [a gift of Dr. Ellis 
Reinherz, Dana Farber Cancer Institute, Boston, 
Massachusetts]. We washed the column extensively 
and eluted the bound material as 0.5 ml fractions 
with 50 mM glycine-HCl (pH 2.5), 150 mM NaCl, 0.1 mM 

35 EGTA and 5 pg/ml bovine pancreatic trypsin inhibitor, 
Aprotinin (Sigma #A1153 ) . We used Western blots 
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developed with' rabbit: antisera raised against: peptide 
JB-2 t follow the purificati n. We employed silver 
stained gels to follow binding and elution of rsT4.2 
during the chromatography. Figure 23 depicts a 
5 Coomassie stained gel of purified rsT4.2. 

Gel sizing-column chromatography analysis 
of the purified rsT4.2 from the pBG380 trans fected 
CHO cell line, BG380G, suggests that rsT4 is monmeric 
under physiologic pH and salt concentration. 

10 Sequencing Of Recombinant 
Soluble T4 Protein 

We then determined the N-terminal amino 
acid sequence of a recombinant soluble T4, specifi- 
cally rsT4.2, molecule purified from the conditioned 

15 medium of the pBG380 trans fected CHO cell line BG80G, 
as described above, by automated Edman degradation 
in an Applied Biosy stems 470A gas phase sequenator 
[R. B. Pepinsky et al., J. Biol Chem. , 261, 
pp. 4239-46 (1986)]. 

20 The amino terminal sequence matched the 

sequence which we had previously determined for 
solubilized native T4 isolated from U937 cells, supra , 
The amino terminal sequences of native solubilized 
T4 (sT4) and purified rsT4 protein are A2 proteins, 

25 as compared to the amino terminal sequence predicted 

by Maddon et al., (1985), supra, with the mature amino 
ter m i n us located at position 3 of that sequence. The 
amino terminal sequences of solubilized native T4 

( sT4 ) 7 recombinant soluble-T4- ( rsT4 72 ) secreted by 

30 CHO trans f ectant BG380G containing pBG380 and the 
protein sequence deduced by Maddon et al . (1985), 
supra are as follows : 

ST4 : X-K-V-V-L-X-K-K-X-D-T-V-E-L-T-X-T-A-S-E- 
rsT4 . 2 : N-K-V-V-L-G-K-K-G-D-T-V-E-L-T-X-T-A-S-E- 
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Maddon 

et al. Q-G-N-K-V -V-L-G-K-K-n-D-T-V-E-L-T-C-T-A-S-E 



In the above sequences, the amino acids 
are represented by single letter codes as follows: 



Phe: 


F 


Leu: 


L 


He: 


I 


Met: 


M 


Val: 


V 


Ser : 


S 


Pro: 


p 


Thr: 


T 


Ala: 


A 


Tyr: 


Y 


His: 


H 


Gin: 


Q 


Asn: 


N 


Lys: 


K 


Asp: 


D 


Glu: 


E 


Cys: 


C 


Trp : 


W 


Arg: 


R 


Gly: 


G 



10 X: not determined or ambiguous. 

We also constructed pBG211-ll, a plasmid 
coding for the N- terminal 113 amino acids of soluble 
T4 protein. This construct, which codes for a pro- 
tein characterized by a single disulfide bridge, 
15 between the cysteines at amino acid positions 18 and 
86, is conveniently expressed in E.coli . 

To construct p211-ll, as depicted in Fig- 
ure 24, we first cut pl95-8 (see Figures 8D and 9A) 
with Clal to remove the Clal-Clal cassette contain- 
20 ing the cDNA sequence of rsT4.2. We then digested 
pAX153Tf3SH16AAiiip, the tryptophan operon promoter 
plasmid from the gamma interferon producing E.coli 
strain BN374 with Cla l, and deleted the cDNA coding 
for gamma interferon. Subsequently, we inserted 
25 the Clal -Clal cassette into the Cla l -cut E.coli 

plasmid in front of the tryptophan operon promoter 
and' li gated to produce pl96-10. 

As shown in Figure 25, we then subjected 

_ pBG380 _to oligonuclej)tide-directed mutagenesis to 

30 . insert three tandem translational stop codons follow- 
- ing the T4 cDNA sequence coding for amino acids -23 
to 113 in pBG380, uo produce pBG394. 

We then constructed p211-ll from fragments 
of each of pl96-10, pBG394 and pl034 as depicted in 
35 Figure 26. The first fragment including the vector 
sequences, was produced by restricting pl96-10 with 
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Hind i I I and Clal to remove the 14 c ding sequence 
from amino acids 61 through 374 of rsT4.2 and includ- 
ing vector sequence following the 3* end of the rsT4 
gene. The second fragment, a Hind i 1 1 - Bglll segment 
5 including the codons for T4 amino acids 61-113 of 
rsT4.9 immediately followed by a triplet of stop 
codons in tandem, was isolated by Hind i I I /Bql 1 1 diges- 
tion of pBG394. The third fragment, a BamHI - cla l 
fragment containing a bacteriophage T4 transcriptional 

10 termination signal [H. N. Kirsch and B. Allet, 

"Nucleotide Sequences Involved In Bacteriophage T4 
Gene 32 Translational Self-Regulation", Proc. Natl. 
Acad. Sci- USA , 79, pp. 4937-41 (1982)], was isolated 
by BamH I /CI a I digestion of pl034. We then li gated 

15 these three fragments to produce p211-ll, a T4 con- 
struct coding for a 113 amino acid soluble form of 
T4 protein, with asparagine at amino acid position 3 
(i.e., rsT4. 113.1). 

We then subjected p211-ll to oligonucleo- 

20 tide site-directed mutagenesis (Figure 27) to change 
the amino acid at position 3 from asparagine to 
lysine using the oligonucleotide T4-66: 

ii 

5* ATG CAG GGT AAA 

I I 

AAA GTA GTA CTG 
GGC 3* . 

This produced plasmid p214-10, a fully 

corrected 113 amino acid soluble T4 vector coding 
"for a 113 amino acid soluble form of T4 protein, 
with lysine at amino acid position 3 (i.e., 
rsT4. 113.2). As shown in Figure 27, we subjected 
p214-10 to oligonucleotide site-directed mutagene- 
sis to delete glutamine and glycine at, respectively, 



30 
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amino acid p sitions 1 and 2 of the T4 sequence using 
the oligonucleotide T4AID-87: 

C 



i 



GTA TCC ATT TGG 
ATG ATG AAA AAA 
GTA GTA 3 1 . 



This produced p215-7, a 111 amino acid 
soluble T4 construct, including the trp promoter, 
which directs the expression of a 111 amino acid 

10 soluble form of T4 protein, with lysine at amino 
acid position 3 (i.e., rsT4.111). 

We next constructed p218-8, a 111 amino 
acid construct which directs the expression of a 111 
amino acid soluble form of T4 protein, with lysine 

15 at amino acid position 3 (i.e., rsT4.111) under the 

control of the P L promoter, as depicted in Figure 28. 

More specifically, we cut pl97-12 (Figure 
9A) with Clal to remove the 101 bp fragment contain- 
ing linker and terminator sequences. We also cut 

20 p215-7 with Cla l to remove the Cla l - Cla l cassette 
containing the cDNA sequence of rsT4.111 and the +T4 
transcriptional terminator sequence f Kirsch and Allet , 
supra]. Subsequently, we inserted the Cla l - Cla l 
cassette into the Clal -cut pl97-12 to produce p218-8. 

25 In order to express rsT4. 113.1, we trans- 

formed E.coli A89 with p211-ll by conventional 
techniques [Maniatis et al. (1982), supra] to form 
E.coli A89/p211-ll. - E.coli A89 is a tetracycline - 
sensitive derivative of E.coli SG936. We isolated 

30 E.coli A89 from E.coli SG936 according to the method 
of S. R. Maloy and W. D. Nunn, "Selection For Loss 
Of Tetracycline Resistance By Escherichia coli " , 
J. Bact. , 145, pp. 110-12 (1981), which is based 
upon the ability of the lipophilic chelating agent 

35 fusaric acid to selectively inhibit resistant strains. 
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More specifically, w:_ plated E.c oli SG936 on medium 
c ntaining, per liter, 5 g tryptone. 5 g yeast extract, 
10 g NaCl, 10 g NaH 2 P0 4 *H 2 0, 50 mg chlortetracycline- 
HCl, 12 mg fusaric acid, 0.1 mM 2nCl 2 and 15 g agar. 
5 Colonies which grew at 30°C (putative tetracycline- 
sensitive strains ) were retested for tetracycline 
sensitivity on L-agar plates containing 5 pg/ml 
tetracycline. One tetracycline-sensitive strain, 
designated A89 # was then shown to be unable to grow 

10 on LB agar at 42 °C, thus verifying the presence of 
the htpR mutation. 

Trans form ants were selected by tetracycline 
resistance. We picked a single colony into 20 ml of 
minimal medium plus 0.2% c as amino acids plus trypto- 

15 phan (100 pg/ml) plus tetracycline (10 pg/ml) in a 

100 ml shake flask placed in a shaking air incubator 
at 30°C and allowed the cells to grow up overnight. 
The following morning, we inoculated 40 ml of mini- 
mal medium plus 0.2% cas amino acids plus tryptophan 

20 (100 pg/ml) plus tetracycline (10 pg/ml) with the 

overnight culture at OD ft00 = 0.05 in a 500 ml flask. 
The cells were grown to midlog phase and then induced 
by pelleting, washing once in minimal medium and 
then resuspending in minimal medium plus 0.2% cas- 

25 amino acids plus tetracycline (10 pg/ml), in the 
absence of tryptophan . We removed 0.6 OD 60Q of 
cells after 0, 1, 2, 3 and 4 hours incubation and 
after growth overnight. 

The aliquots were centrifuged and cell 

30 pellets were sub j ected to lysis by boiling in " 
Laemmli gel loading buffer. After centrifugation to 
remove cell debris, half o^ each sample was subjected 
to SDS-PAGE, followed by Western blot analysis with 
our rabbit antipeptide antibody probes or by Coomassie 

35 blue protein staining (Figures 29A and 29B). 
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Purification Of rsT4. 113,1 

We then purified rsT4. 113.1 from the E.coli 
transformant by means of two essentially quantitative 
steps involving anion-exchange and gel- filtration 
5 chromatographies performed under reducing and dena- 
turing conditions. 

More specifically, we suspended 14 g of 
wet cells from a 4 L shake- flask: fermentation in 
100 ml of a 20mM Tris (pE 7.5) buffer containing 

10 20 pg/ml DNase, 20 pg/ml RNase and 1 mM phenylmethyl- 
sxilfonyl fluoride ( "PMSF* ) . The suspension was applied 
to a French Press at 1000 psi in two passages and 
f h^n centrifuged in an SA 600 rotor at 18*000 g for 
15 min at 4°C. The resulting pellet was solubilized 

15 in 20 ml of a 20 mM Tris (pH 7.5) buffer containing 
7 M urea and 10 mM 2 -mere aptoethanol . We then sub- 
jected the suspension to ultracentrifugation at 
85,000 g for 90 min at 4°C. The supernatant was 
diluted by the addition of 80 ml of 20 mM Tris 

20 (pH 7.5) buffer containing 7 M urea and 10 mM 
2 -mer cap toe thano 1 and 40 ml of the sample was 
applied to a 3 x 4 cm Q-Sepharose fast-flow column 
(Sigma, St. Louis, Missouri) which had been pre- 
equilibrated in the same buffer . The column was 

25 developed with a gradient in 400 ml total volume of 

increasing NaCl from 0 to 0.3 M in the same Tris /urea/ 
2 -mere aptoethanol buffer. Column fractions were 
monitored for absorbance at 280 nm and for protein 
content by SDS-PAGE (15% acrylamide). The fractions 

30 were also analyzed by Western blots. Figure 30, 

. panel (a) is a chromatogram displaying the purifi- 
cation of rsT4. 113.1 by ion-exchange chromatography. 
In that figure, peaks containing rsT4. 113.1 are 
identified. The rsT4. 113.1 was found to elute early 

35 in the NaCl gradient and to be well-resolved from 
low-molecular weight contaminants. 
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In order to separate rsT4. 113.1 from high- 
molecular weight contaminants, we carried out gel- 
filtration chromatography on an rsT4. 113 .1 -containing 
pool for final purification of the protein to near 
5 homogeneity (>95% purity). More specifically, we 

prepared a pool containing 20 mg of protein in SO ml 
and then concentrated to 10 ml in a stirred-cell 
ultrafiltration unit (Ami con, Danvers, MA. ) using a 
PM-30 membrane (Ami con). Subsequently, 5.0 ml of 

10 the concentrate was applied to a 1.5 x 95 cm S-300 
column (Sigma) equilibrated and developed in the 
same Tris/urea/2-mercaptoethanol buffer- We moni- 
tored the column fractions for absorb ance at 280 run 
and for protein content by SDS-PAGE. The fractions 

15 were also analyzed by Western blots. A pool con- 
taining rsT4. 113.1 (approximately 4 mg) in 15 ml was 
thus prepared. Figure 30, panel (b) is a chroma to- 
gram displaying the purification of rsT4. 113.1 by 
gel-filtration separation of the rsT4. 113.1 pool. 

20 In that figure, peaks containing rsT4. 113.1 are 
identified. 

Figure 30, panel (c) is an SDS-PAGE analysis 
depicting the purification of the rsT4 derivative 
throughout the centrifugation and chromatography 
25 steps. In Figure 30, panel (c), the lanes depicted 
are: 

. lane A: molecular weight standards 
lane B: cell extracts 

lane C: cell pellet following solubilization 
30 of cell extract in non-denaturing 

conditions 

lane D: supernatant following solubilization 
of cell extract in non-denaturing 
buffer 

35 lane E: supernatant following ultracentri- 

fugation step 
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lane F: Q-Sepharose pool 

lane G: S-300 gel- filtration pool. 

Refolding Of Purified rsT4. 113.1 

We refolded the purified rsT4. 113,1 by 
5 dilution and dialysis steps to non-denaturing and 
oxidized conditions. More specifically, refolding 
of the protein at a concentration of 0.5 OO (280) /ml 
was achieved by stepwise dialysis against 500 volumes 
of 3 M urea, 20 dM Tris (pH 7.5); 500 volumes of 1 M 

10 urea, 0.1 M ammonium acetate (pH 6.8) and, finally, 
the same volume of a phosphate-buffered saline solu- 
tion. Throughout the refolding procedure, samples 
of the protein were monitored for relative content 
by spectral analysis and by high-performance liquid 

15 chromatography ( "HPLC" ) performed on a 150A liquid 
chromatographic system (Applied Bi ©systems. Inc., 
Foster City, California). An octasilyl column 
(Aquapore RP-300, 0.46 x 3.0 cm) was equilibrated in 
80% 0.1% trifluoroacetic acid ( "TFA" )/water (sol- 

20 vent A) and 20% 0.085% TFA/70% acetonitrile (sol- 
vent B) and developed with a linear gradient of 
increasing acetonitrile concentration from 20% to 
80% (solvent B) over 45 min at a flow rate of 
0.5 ml/mi n . 

25 As shown in Figure 31, panel (a), protein 

in 7 M urea, 10 mM 2-mercaptoethanol and 20 mM 
Tris(pH 7.5) eluted from the HPLC column at 49% 
acetonitrile in_ the gradient. In subsequent steps, _ _ 
- from 1 M urea/1 mM ammonium acetate (pH 6.8) [Fig- 

30 *ure 31, panel (b)] to phosphate buffered saline 

[Figure 31, panel (c)], an increasing percentage of 
rsT4. 113.1 was found to elute earlier in the HPLC 
gradient — at 47% acetonitrile. The identity of 
the earlier eluting peak as oxidized product was 

35 verified by reduction of rsT4. 113.1 in non-chaotropic 
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soluti ns and application of sample thus treated to 
HPLC under the same conditions. 

The elution of oxidized rsT4. 113.1 prior to 
reduced protein on HPLC suggests that formation of 
5 the single disulfide bridge decreases relative hydro - 
phobicity of the protein [J. L. Browing et al.. Anal . 
Biochem. , 155, pp. 123-28 (1986)]. Spectral analysis 
of rsT4. 113.1 was performed throughout the course of 
refolding in order to monitor relative yield of solu- 
10 ble protein in the procedure. The refolding method 
allowed approximately 20% recovery of rsT4. 113.1. 
HPLC analysis indicated a less than 15% contaminant 
of reduced protein in the preparation (Figure 30 , 
panel (c), lane G). 

15 Sequencing Of Renatured rsT4.113 

We then carried out amino acid analysis of 
rsT4. 113.1 by automated Edman degradation in an 
Applied Biosys terns 470A gas phase sequenator equipped 
with a 900 A data system. Phenyl thiohydantion amino 

20 acids generated during the course of the degradative 
chemistry were analyzed on-line using an Applied 
Biosystems 120A PTE-analyzer equipped with a PTH-C18 
2.1 x 220 mm column. Protein (10 pg) for sequence 
analysis was applied to SDS-PAGE (15% aery 1 ami de) 

25 and elecrtroblotted on an Immobilon membrane (Hillipore 
Corp*, Bedford. Massachusetts) as described by 
P. Matsudaira, J. Biol . Chem. , 262, pp. 10035-38 
(1987). 

Amino acid analysis of protein samples was 
30 performed by hydrolysis of protein in 6 N HC1, in 
vacuo, for 24 h at 110 °C. The hydrolysates were 
then applied to a Beckman 6300 Analyzer equipped 
with post-column detection by ninhydrin. Western 
blot analysis of the SDS-PAGE gels was carried out 
35 by standard techniques using rabbit antisera JB-1. 
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Sequence analysis re ealed an amino terminal sequence 
of: Met-Gln-Gly-Asn-Lys-Val-Val . . - 

The purified rsT4. 113.1 protein was found 
to contain stoichiometric quantities of the amino- 
5 terminal methionine placed in the protein construct 
for expression in E.coli and an intact polypeptide 
chain consistent with a sequence derived from the 
plasmid construction. Recovery of phenyl thiohydan- 
toinyl -methionine at the first cycle of the degrada- 

10 tive chemistry was 60% consistent with routine initial 
yields obtained in the automated Edman. This obser- 
vation excludes the possibiity that a significant 
percentage of the rsT4. 113.1 lacked the initiation 
methionine, i.e., the NH^ -methionine was not removed 

15 by expression of rsT4. 113.1 in E.coli , or that sequence 
analysis was impaired by the presence of glutamine 
at the first cycle of the degradative chemistry. 
Sequence analysis was performed for 40 cycles and no 
evidence of lysine carbamylation was observed. Amino 

20 acid analysis displayed a close correlation of actual 

and theoretical values for amino acids, thus indicating 
the marked absence of proteolytic degradation in the 
course of expression, or purification, or both. 

Immunoprecipitation Of CHO Cell 
25 Lines Producing Soluble T4 

35 

We tested the conditioned media from S-Cys 
metabolically labelled CHO cells trans fected with 
one of the T4 mutant constructs pBG377, pBG380, pBG381, 
the full length recombinant T4 construct pBG379, of 

30 this invention or vector only, to determine whether 
* any produced a molecule recognized by the anti-T4 
monoclonal antibody 19 Thy. To carry out this test, 
we incubated about 10 7 CHO cells trans fected with 
either pBG380, pBG381, pBG377, pBG379 or pBG312, for 

35 5 hours at 37°C with 180 pCi/ml 35 S-labelled cysteine 
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[DuPont, New England Nuclear] in 4 ml RPMI cys" medium 
(Gibe ). After labelling of the cells, 1 ml of fil- 
tered, conditioned media was made 0.5 mM with phenyl - 
methyl -sulphonyl fluoride and immunoprecipitated 
5 with OKT4 and protein A Sepharose [P. H. Sayre and 
E. L. Re inhere, Eur. J. Immunol. , 15, pp. 291-9S 
(1985)]. Subsequently , we incubated media from the 
35 S-labelled cells with OKT4 (ATCC #CRL 8002). We 
then immuno-precipitated with protein A Sepharose 

10 and subjected the i mmuno-precipi tates to SDS-PAGE 

under reducing conditions on 10% polyacrylamide gels 
[U. K. Laemmli, Nature , 227, pp. 680-85 (1980)]. 
Autoradiography was carried out with X-Omat X-ray 
film (Eastman Kodak) . 

15 As shown in lanes 3-5 of Figure 32, both 

pBG380 (rsT4.2) and pBG381 (rsT4.3) directed the 
synthesis of a secreted, immune, 35 S-labelled T4 
protein that was recognized by the OKT4 anti-T4 
antibody. The immunoprecipitated truncated mole- 

20 cules migrated as 49 Kd proteins, a result consis- 
tent with their predicted molecular weights. In 
contrast, no soluble T4 antigen could be detected in 
the conditioned media of cell lines stably trans - 
fected with pBG377 (rsT4.1) or pBG379 (rflT4). 

25 Immunoprecipitation analysis of cellular extracts of 
cell lines trans fected with pBG377 suggests that the 
rsT4.1 gene may be mis folded, which could account 
for a block: in its secretion [M. J. Gething et al.. 
Cell , 46, pp. 939-50 (1986)]. 

30 In Figure 32, the lanes represent the 

following: Lane 1 : immunoprecipitation from condi- 
tioned medium of CHO cells stably co- trans fected 
with vectors pBG312 and pAdD26. Lane 2 : blank. 
Lanes 3 and 4 : immunoprecipitation from conditioned 

35 medium of CHO cells stably co-trans fected with pBG380 
(rsT4.2) and pAdD26. Lanes 5 and 6 : immunoprecipita- 
37 17 ^ on ^ roxn conditioned medium of CHO cells stably 
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co- trans fected with pBG381 (rsT4.3) and pAdD26 . 
Lane 7 : immunoprecipitation from c nditioned medium 
of CHO cells stably co-transfected with recombinant 
full length T4 (pBG379) and pAdD26. In Figure 32, 
5 the arrow indicates the predicted position of the 
soluble T4 from pBG380 or pBG381 relative to the 
migration of standard molecular weight markers. 

I mmunopr ecip i ta ti on Of COS 7 Cell Lines 
Producing Recombinant Soluble T4 

10 We expressed recombinant soluble T4 

derivatives pBG392, pBG393 and pBG394 in COS 7 cells 
by electroporation, essentially as described by 
G. Chu et al. , "Electroporation For The Efficient 
Trans f ection Of Mammalian Cells with DNA" , Hue. 

15 Acids Res. , 15, pp. 1311-26 (1987). More specifi- 
cally, we introduced 20 closed circular plasmid 
DNA and 380 |ig of carrier (sonicated salmon sperm 
DKA) into 3 x 10 7 COS 7 cells. The cells were 
electroporated using a Gene Pulser (Biorad) set at 

20 300 volts. Subsequently, we incubated the COS 7 

cells in Dulbecco's Modified Eagle's Medium supple- 
mented with 10% fetal calf serum for 24 hours. We 
then harvested the conditioned media, filtered it 
through a Millipore Minidisk 0.22 p hydrophilic 

25 filter cartridge (Millipore #MCGL 305-01) and 

concentrated the secreted proteins on a fast-S ion 
exchange column (S-Sepharose Fast Flow, Pharmacia 
#17-0511-01) in 20 mM MES buffer (pH 5.5). 

We then eluted the bound proteins with 

30 20 mM Tris-HCl (pH 7.7) and 0.3 M NaCl. The elution 
pool was subsequently diluted with 2 volumes of 20 mM 
Tris-HCl (pH 7.7) and it was then loaded on a column 
comprising either 19Thy anti-T4 monoclonal antibody 
and protein A Sepharose or OKT4A and protein A 

35 Sepharose. We washed the column extensively and 

eluted the bound material as 0.5 ml fractions with 
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50 mM glycine-BCl (pH 2.5), 150 mM NaCl, 0.1 DM EGTA 
and 5 vg/ml Bovine pancreatic trypsin inhibitor, 
Aprotinin (Sigma, #A1153 ) . The immunoprecipitates 
were subjected to SDS PAGE (10% gel) followed by 
5 immunoblotting against rabbit antisera raised 

against peptide JB-1. We employed silver stained 
gels to follow binding and elution of rsT4 during 
chromatography . 

Figure 33 depicts an immunoblot analysis of 

10 transiently expressed pBG392 (rsT4.7) [lanes 10, 
11]; pBG393 (rsT4.8) [lanes 4, 7, 8] and pBG394 
(rsT4.9) (lane 5]. The standards are 50 ng purified 
rsT4.3 (lane 1); 150 ng purified rsT4.3 (lane 2) and 
250 ng purified rsT4 . 3 (lane 3). The arrow indicates 

15 the expected position of migration of a protein with 
the relative molecular weight of rsT4 . 7 : 21,000 
daltons. The sample that was to be loaded into lane 4 
was lost and lanes 6 and 9 are blank. 

As shown in lanes 10 and 11 of Figure 35, 

20 pBG392 (rsT4.7) directed the synthesis of a secreted, 
immune protein that was recognized by the anti-T4 
antibodies OKT4A and 19Thy. Lanes 4, 7 and 8 also 
demonstrate that pBG393 (rsT4.8) directed the 
synthesis of a secreted, immune protein that was 

25 recognized by OKT4A and 19Thy. This analysis 

illustrates that rsT4.7 contains the OKT4A epitope. 
It also suggests that the binding region for HIV 
envelope binding resides in the amino 182 terminal 
residues of T4. 

30 " ~ " in contrast, no soluble -T4- could- be detected-- 

- in the media of cell lines trans fected with pBC394 
"(rsT4.9) [see lane 5]. Immunoprecipitation analysis 
of cellular extracts of cell lines transfected with 
pBG397, however, showed that rsT4.9 was recognized 
35 by OKT4A. We believe that rsT4.9, a 113 amino acid 

construct, binds the HIV virus and that it represents 
a second generation soluble T4, one with only two 

3719 



WO 89/01940 



PCT/US88/02940 



-70- 

cysteines and one of three disulfide bridges. 
Accordingly, rsT4.9 is easily produced in E.coli or 
yeast systems. 

Similarly, although no soluble T4 could be 
5 detected in the media of cell lines trans fected with 
pBG396 <rsT4.12), analysis of cellular extracts of 
those cell lines showed that rsT4.12 was recognized 
by 0KT4A. Thus, rsT4.12 may also bind HIV virus. 

Radioimmunoassay And Epitope 
10 Analysis Of rsT4.113 

In order to determine if the 113 fragment 
of rsT4 contained structural determinants for binding 
to OKT4A, Leu-3A and OKT4, we then carried out 
radioimmunoassay and epitope analysis of rsT4.113 

15 using a competitive inhibition radioimmunoassay 

[C. J. Newby et al. , "Solid-Phase Radio immune Assays" 
in Handbook Of Experimental Immunology, D. M. Weir 
(Ed.). 1# PP- 34.1-34.8 (1986)]. As OKT4A and Leu-3A 
block infectivity of HIV in vitro f Dalgleish et al. , 

20 supra] and binding of T4 to gpl20/160 [ McPouqal et al. , 
supra] , this analysis served as a first approximation 
as to whether or not rsT4 . 113 contained structural 
elements for interaction with HIV. 

We first coated U-bottom 96 well microti ter 

25 plates (Falcon) with 50 p I/well goat-anti-mouse IgG 
(Hyclone Typing Kit. Logan, Utah) in PBS (pH 7.0) to 
a concentration of 50 %ig/ml and incubated the plates 
overnight at 4 °C- We then_ rinsed _the plates _ with 
IX PBS and blotted them dry. The plates were then 

30 - blocked by the addition of 100 pl/well of a IX PBS 
solution containing 5% bovine serum albumin for 
1 hour at room temperature. We rinsed the plates 
with PBS, blotted dry and then spotted them with 
50 pi of one of three antibody solutions containing 

35 either OKT4 (10 pg/ml in block buffer); OKT4A 
(500 ng/ml in block buffer) or Leu-3A (Becton- 
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Dickinson) (500 ng/ml in block buffer). We let the 
plates stand for 2 hours at r om temperature. We 
then washed the plates 3 times with a PBS/0.05% 
Tween-80 solution and 2 times with IX PBS and blotted 
them dry. 

In a separate plate , we titrated competitor 
samples of unlabeled rsT4. 313.1 from 20 pg/ml and 
serially diluted twice (including no competitor con- 
trol), with final volumes in each well of 25 pi. 
The positive control for this assay was competition 

with unlabeled rsT4.3 (375 amino acids). We then 
125 

added 25 pi of * I-rsT4.3 containing 10 , 000 
cpm/25 pi (prepared according to A. E. Bolton and 
w. M. Hunter, Radioimmunoassay And Related Methods , 
Chapter 2c). Subsequently, we spotted the entire 
50 pi content of each well onto the assay plate con- 
taining each of the three antibody solutions and 
incubated for 2 h at room temperature. We 
washed the plates 3 times with a PBS/0. 5% Tween-80 
solution and 2 times with IX PBS, blotted them dry 
and then counted the wells in a Beckman gamma counter 
for radioactivity. 

As shown in Figure 34, rsT4. 113.1 competes 

125 

with I-rsT4.3 for absorption to an OKT4A solid 
phase in a dose-dependent manner. Additionally, 

IOC 

rsT4. 113.1 competes with I-rsT4.3 for absorption 
to a Leu-3A solid phase in a dose-dependent manner. 
By comparison to unlabeled rsT4.3, rsT4. 113.1 exhibits 
a molar affinity for those antibodies within a factor 
of 3. In the 0.4 to 25 pg/ml concentration range 
tested, rsT4.113 did not compete with radiolabeled 
rsT4.3 for binding to OKT4. In a similar assay, we 
observed that rsT4.111 also competes with 125 I-rsT4.3 
for binding to OKT4A and Leu-3A, but not to OKT4 
[Figures 35-37] . 

Based on these results, we believe that 
the epitopes for OKT4A and Leu-3A are contained within 
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the amino- terminal ll - * amino acids of T4. We also 
believe that the epitope for OKT4 binding is localized 
within the carboxy terminal of the T4 polypeptide. 

Accordingly, we believe that the gpl20- 
5 binding domain is localized within the amino terminal 
113 or 111 amino acids of the T4 protein. Based on 
this belief, we synthesized various synthetic oligo- 
peptides which contain sequence within that structural 
domain. These oligopeptides are represented in 
10 Figure 3 as follows: 

Oligopeptide Amino Acid Coordinates 

JB-1 44-63 
rsT4 #6 18-29 
rsT4 #7 5-56 
15 rsT4 #8 84-97 

rsT4 #9 30-63 
We synthesized these peptides using conventional 
phosphoamlde DHA synthesis techniques [ Tetrahedron 
Letters , 22, pp. 1859-62 (1981)]. We synthesized 
20 the peptides on an Applied Biosys terns 380A DHA 

Synthesizer and* purified them by gel electrophoresis. 

ELISA Assay For rsT4.113 

We also carried out an ELISA assay for 
rsT4. 113.1 produced by p211-llr trans formed E.coli . 

25 Throughout this assay # dilutions were made in block- 
ing solution and, between each step, we washed the 
plates with PBS/0.05% Tweeh-20. More specifically, 
we coated wells of Immulon 2 (Dynatech, Chantilly, 
Virginia )— plates-with- - 005- GD (-280- nm ) /ml— o £ OKT4 

30 (IgG2b) in 0.05 M bicarbonate buffer to a volume of 
"50 pl/well and incubated the plates overnight at 
4°C. We then blocked the plates with 5% bovine 
serum albumin in PBS , 200 pl/well, and incubated for 
30 minutes at room temperature. 

35 subsequently, we added 50 pi of 50 ng/ml 

rsT4.3 to each well, incubating overnight at 4°C. 
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We then added 50 pl/well of a mixture containing 
rsT4. 113.1 and 10 ng/ml of OKT4A and incubated for 
2 1/2 hours at room temperature. Using a Hyclone 
Kit (Hyclone), we then carried out the following 
5 steps. First, we added 1 drop of rabbit anti -mouse 
IgG2a to each well and incubated the plates for 
1 hour at room temperature. We then added 100 pi of 
peroxidase-labeled anti-rabbit IgG, diluted 1:4000 
with blocking buffer to each well, and incubated for 
10 1 hour at room temperature. 

We prepared a substrate reagent as follows. 
We diluted substrate reagent 1:10 in distilled water 
and added two O-pheny 1 - e thy 1 ene- di amine ("OPD") 
chromophore tablets per 10 ml of substrate. We let 
15 the mixture dissolve thoroughly by mixing with a 

vortex. Alternatively, a TMB peroxidase substrate 
system (Kirfcegaard & Perry Catalogue 950-76-00) may 
be used. Subsequently, we added 100 pi of the 
chromophore solution to each well, incubated for 
20 10-15 minutes at room temperature and then stopped 
the color development with 100 pi of IN B^SO^. We 
then measured OD at 490 nm, using an ELISA plate 
reader. 

The results of the assay are demonstrated 
25 in Figure 38. 

We then subjected the soluble T4 proteins 
produced by the T4 constructs of this invention to 
various functional assays . 



30 



.. Assays Of The Antiviral Activity Of Soluble T4_ 

The antiviral activity of soluble T4 
according to this invention was evaluated using 
modifications of various in vitro systems used to 
study antiviral agents and neutralizing antibodies 
[D. D. Ho et al., "Recombinant Human Interferon Alpha 
35 (A) Suppresses HTLV-III Replication In Vitro ", 
3 7 23 Lancet , pp. 602-04 (1985); K. Hartshorn et al,. 
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"Synergistic Inhibition Of HTLV-ill Replication 
In Vitr By Phosphonof nnate And Recombinant Inter- 
feron Alpha-A", Antimicrob Aq Chemoth , 30, pp. 189-91 
(1986) J. 

5 For each of these assays, we prepared graded 

concentrations of soluble T4 and preincubated them 
with an H9 derived 1 1 IB isolate of HIV [a gift from 
Drsl M. Popovic and R. Gallo, National Cancer 
Institute, Bethesda, Maryland]. The isolate was 

10 maintained as a chronically infected culture in H9 
cells. Cell-free HIV stocks were obtained from 
supernatant fluids of HTLV-III infected H9 cultures 
(culture conditions: 1 x 10 6 cells/ml with 75% viable 
cells ) . We prepared serial 10 fold dilutions of 

15 recombinant soluble T4 ranging from 10 picograms/ml 
to 10 mi cr ogr ams/ml and incubated *»Hj»m with fifty 
50% tissue culture infectious doses (TCID 5() ) of HIV 
fcr 1 hour at 37°c, in RPMI-1640 supplemented with 
20% heat inactivated fetal calf serum (FCS). We 

20 then added 150 pi of H9 cells to a final concentra- 
tion of 0.5 x 10 6 cells/ml which were not HIV-infected 
to the wells containing aliquots of the recombinant 
soluble T4/HIV mixture. 

We adjusted each virus inoculum to a con- 

25 centra tion of 250 TCID 50 /ml- We preincubated 100 pi 
of the virus inoculum with 200 pi recombinant solu- 
ble T4 or 100 pi immunoglobulin prepared in tripli- 
cate serial 2-fold dilutions for 1 hour at 37 °c 
prior to inoculation onto 1.5 - 2 x 10 6 H9 cells in 

30 5 ml RPHI 1640 supplemented fetal calf serum (20%), 
HEPES (lOmM), penicillin (250 U/ml ) , streptomycin 
(250 pg/ml) and L-glutamine (2mM). On days 5, 6, 7, 
10 and 14, we examined each culture for characteris- 
ic cytopathic effects ("CPE"). Neutralization was 

35 defined as the inhibition of syncytia formation com- 
ared with controls. 
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Thc pbsitivfc control us*»d was HIV ser posi- 
tive neutralizing serum, as described in D. D. Ho 
et al . , "Human Immunodeficiency virus Neutralizing 
Antibodies Recognize Several Conserved Domains On 
5 The Envelope Glycoproteins" , J . Virol . , 61, 

pp. 2024-28 (1987). The negative controls used 
were HIV seronegative serum only and buffer only. 

Cytopat>"^ Effect Assay (CPE) 

In this assay, following conventional 
10 protocols for cytopathic effect assays f Klatzmann 

et al. (1984), supra and Wong-Staal and Gallo (1985), 
supra], we microscopically examined the H9 cells for 
evidence of cytopathic effects of HIV. 

The CPE was scored on a four point scale 
15 from 1+ to 4*, with 4+ representing the highest 
degree of CPE. 

By day 14, wells containing recombinant 
soluble T4 according to this invention (rsT4.2, 
derived from the pBG380 trans fee ted CHO cell line 
20 BG380) at 10 vg/ml showed no evidence of CPE, while 
the negative control showed 1+ to 3+ CPE. 

p24 Radioimmunoassay 

We then tested soluble X4 as an inhibitor 
of viral replication in an HIV virus replication 

25 assay- according to D. D. Ho et al., J. Virol. , 61, 
pp. 2024-28 (1987) and J. Sodroski et al.. Nature , 
322, pp. 470-74 (1986). We carried out the assay 
essentially as described, except that the cultures 
were propagated in microtiter wells containing 

30 200 pi- In this assay, we evaluated the ability of 
the soluble T4 polypeptides of this invention to 
block HIV replication, as measured by HIV p24 
antigen production. We sampled supernatants twice 
weekly for HIV p24 antigen as described below. 
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We btained an assay kin [HTLV-III p24 
Radioimmun assay System, Catalogue Nc NEK-040, 
NEK- 040 A, Biotechnology Systems, New Research 
Products, Dupont] which contains affinity purified 
125 I labelled HIV p24 antigen, a rabbit anti-p24 
antibody and a second goat anti-rabbit antibody which 
is used to precipitate antigen-antibody complexes. 
We carried out the assay according to the protocol 
included with the kit. Accordingly, we mixed a sample 
to be assayed or one of a series of amounts of 
unlabelled p24 antigen with a fixed amount of 125 1 
labelled p24 and a fixed limited amount of rabbit 
anti-p24 antibody. We incubated the samples over- 
night at room temperature and then added a goat 
anti-rabbit immunoglobulin preparation for 5 minutes 
at 40°C. We centrifuged the samples in a microfuge 
and aspirated the supernatant fluid. Felletted 12 5 I 
labelled p24 was quantitated for each sample by gamma 
counting and a standard curve for the 125 I p24 dis- 
placed by the known amounts of antigen added to 

standard tubes was constructed. We then calculated 
125 

the I labelled p24 displaced by the antigen present 
in the unknown samples by interpolation using the 
standard curve constructed from the known amounts of 
p24 antigen contained in the standard samples. The 
results are shown in the table below. 
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p24 ASSAY OF H~V REPLICATION INHIBITION 



10 



Day 



10 



14 



rsT4.2 


Patient 
Serum 


Average 
CPM 


% Bound/ 
Unbound 


5.0** 


Negative 
Positive 


344 

2,237 

551 
j j ± 

1,766 


8.5 
112,4 

1 Q Q 
86.6 


0.5* 
5.0** 


Negative 
Positive 


230 
2,459 

322 
1,980 


2.2 
124.6 
7.3 
96.3 


0.5* 
5.0** 


Negative 
Positive 


221 
2,284 
246 
1,988 


1.8 
115.0 
3.1 
98.7 



15 

These results demonstrate that soluble T4 
according to this invention at a concentration of 
5 tig/ml completely i nhib its virus replication as 
measured in this standard 14 day assay. These 
20 results are also depicted in Figure 39 in graphic 
form. In Figure 39, values were calculated from a 
standard curve of p24 according to assay kit 
instructions . 



25 * This concentration was initially believed to be 
1.0 pg/ml, based upon our preliminary approximation 
that 1 unit of absorbance at 280 nm ("^on")* was 
equivalent to 1 mg of rsT4.2. Absorbance at 280 nm 
is a commonly used first approximation of protein 
30 concentration. Upon amino acid analysis of the pro- 
tein, however, we found that it had a higher extinc- 
tion coefficient than originally approximated, with 
unit of rsT4.2 being equivalent to 0.5 mg of 
Jotein.. „_ . 



35 ** This concentration was initially believed to 

-be 10 pg/ml, based upon our preliminary approximation 
that 1 unit of absorbance at 280 nm ("A^qo")* wa s 
equivalent to 1 mg of rsT4.2. Absorbance at 280 nm 
is a commonly used first approximation of protein 

40 concentration. Upon amino acid analysis of the pro- 
tein, however, we found that it had a higher extinc- 
tion coefficient than originally approximated, with 
1 A 2fl0 unit of rsT4.2 being equivalent to 0.5 mg of 
the protein. 
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We then carried out: a pi4 replication assay 
as described ab ve, except that the soluble T4 was 
added to the infected cultures during re feeding at 
days 3, 7 and 10, in order to maintain a constant 
5 rsT4 concentration throughout the infection period. 
The results of this assay are shown in the table 
below. 

INHIBITION OF HIV REPLICATION 
WITH CONSTANT CONCENTRATION OF rsT4 



10 rsT4.2 p24 

(pq/ml) (n<y/ml) 

0.008 770 

0.031 970 

0.125 8S 

15 0.5 0 

5.0 0 

0 1120 

uninfected 0 



These results demonstrate that when solu- 
20 ble T4 protein according to this invention was main- 
tained at a constant concentration throughout the 
infection period, as little as 0.125 pg/ml of the 
protein substantially blocked replication of 250 
TCID 50 /ml of HIV-1. 
25 Advantageous ly, soluble T4 protein accord- 
ing to this invention, at concentrations far exceed- 
ing those required to block viral replication, did 
not exert immuno toxic effects in vitro , as measured 
by three lymphocyte proliferation assays — mixed 
30 lymphocyte response, phytohemagglutinin, and tetanus 
toxoid st i mu la ted response. - ~ . . _ _ 

Syncytia Inhibition Assay 

To further assess the effect of soluble T4 
on HIV env-T4 binding, we evaluated the effect of two 
35 preparations of our soluble T4 protein on the syn- 
cytiagenic properties of HIV in the co-cultivation 
assay* We carried out a C8166 cell fusion assay 
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as described in B. D. Walker et p 1 .., Proc . Natl . 
Acad. Sci. USA , 84, pp. 8120-24 <198<). 

We incubated 1 x 10 9 H9 cells chronically 
infected with HTLV-IIIB for 1 hour at 37°C in 5% 
5 C0 2 with various concentrations of one of two 

preparations of rsT4.2 in 150 pi RPMI-1640 media 
supplemented with 20% fetal calf serum. We then 
added 3 x 10 4 C8166 cells in 50 ul media (a T4 + 
trans formed human umbilical cord blood lymphocyte 

10 line f Sodroski et al . . supra] # to a final volume of 
0,2 ml in each well. Final well concentrations of 
soluble T4 were 0.5 ug/ml* and 5.0 ug/ml* for prepa- 
ration #1 and 1.25 ug/ml* and 12.5 ug/ml* for prepa- 
ration #2. We then counted total number of syncytia 

15 per well at 2 hours and 4 hours after adding the 

C8166 cells at 37°C in 5% C0 2 - Parallel co-cultiva- 
tions used buffer alone (negative control) or OKT4A 
at 25 ug/ml (positive control) as controls. We con- 
sidered a positive result as a 50% reduction in 

20 syncytia compared to controls, at a time when at 
least 100 syncytia per 10 4 infected H9 cells were 
present in the control cultivations. The results of 
this assay are shown below and in Figure 40 (2 hour 
data). 



* These concentrations were initially believed 
to be, respectively, 1 ug/ml, io ug/ml. 2.5 ug/ml 
and 25 ug/ml, based upon our preliminary approxima- 
tion that 1 unit of absorbance at 280 run ( "A~ fl " ) , 
30 was equivalent to 1 mg of rsT4.2. Upon amino acid 
analysis of the protein, however, we found that it 
had a higher extinction coefficient than originally 
approximated, with 1 A MQ unit of rsT4.2 being 
equivalent to 0.5 mg of the protein. 
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INHIBITION IN C8166 FUS "ON ASSAY 



Preparation 

buffer 
rsT4.2 
rsT4.2 
rsT4.2 
rsT4.2 

OKT4A (25 pg/ml) 



2] 


(uv/ml) 


2 Hrs 


4 Hrs 


0 




0 


0 


0. 


5** 


30 


42 


5, 


0** 


54 


47 


1. 


25** 


16 


21 


12. 


5** 


77 


55 


0 




100 


100 



10 As demonstrated in this table and in Fig- 

ure 40, soluble T4 according to this invention at 
5.0 pg/ml and 12.5 pg/ml inhibited syncytia formation 
at 2 hours, as compared to buffer alone. By 4 hours 
after the addition of C8166 cells, soluble T4 at 

15 12.5 pg/ml continued to inhibit greater than 50% 
syncytia formation, as compared to the negative 
control . 

We also evaluated the effect of two prep- 
arations of our soluble T4 protein rsT4.7 on the 
20 syncytiagenic properties of HIV in a similar co- 
cultivation assay. The results of this assay are 
shown below. 



* All assays w ere carried out in triplicate, and 
25 the number of syncytia counted per well was averaged 
to calculate % inhib ition. The % inhibition repre- 
sents the difference between the average number of 
syncytia in the negative control (without rsT4 or 
OKT4A) and the average number of syncytia counted 

30 -- when either rsT4 or OKT4A were -present during the 

assay, divided by the average syncytia count for 
the negative control and multiplied by 100. 

** These concentrations were initially believed 
to be, respectively, 1 pg/ml, 10 pg/ml, 2.5 pg/ml 

35 and 25 pg/ml, based upon our preliminary approxima- 
tion that 1 unit of absorbance at 280 nm ( ,,A 2ao M, ' 
was equivalent to 1 mg of rsT4.2. Upon amino acid 
analysis of the protein however, we found that it 
had a higher extinction coefficient than originally 

40 approximated, with 1 xm ^ t - of rsT4.2 being 

equivalent to 0.5 mg ox^tlhe protein. 
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INHIBITION IN C8166 FUSION ASSAY 
Average 

rsT4.7 Syncytia/SOpl % Inhibition 
5 Preparation ( v g/ml ) alicruot at, 2 Hrs 

H9 cells 0 0 N/A 

( control ) 

C8166 cells 0 0 N/A 

( control ) 

10 HIV-infected 0 118 0 

H9 cells 
' added to 
C8166 cells 
( control ) 

15 OKT4A 0 0 100 

( control ) 

Prep. 1 of 

rsT4.7 s 5.0* 43 63.6 



20 * This concentration */as initially believed to 
be 10 |jg/ml, based upon our preliminary approxima- 
tion that 1 unit of absorbance at 280 nm ( "A 2Q " ), 
was equivalent to 1 mg of rsT4.2. Upon amino acid 
analysis of the protein, however, we found that it 

25 had a higher extinction coefficient than originally 
approximated, with 1 A- Q0 unit of rsT4.2 being 
equivalent to 0-5 mg or the protein. 
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10 



15 



Assay da^e - dav 13 



Prep ar a t i on 

H9 cells 
( control ) 

C8166 cells 
( control ) 

HIV- infected 
H9 cells added 
to C8166 cells 
( control ) 

OKT4A ( control ) 

Prep. 2 of 
rsT4.7 



rsT4 . 7 
(pg/ml) 



S 5.0* 



Average 
Syncytia/50M 1 
aliquot 



141 



% Inhibition 
at 2 Hrs 

N/A 



N/A 



27 



100 



80.9 



* This concentration was initially believed to 
be 10 pg/ml, based upon our preliminary approxima- 
tion that 1 unit of absorbance at 280 nm ( "A ) , 
20 was equivalent to 1 mg of rsT4.2. Upon aminS 8 Scid 
analysis of the protein, however, we found that it 
had a higher extinction coefficient than originally 
approximated, with 1 A„ Q unit of rsT4.2 being 
equivalent to 0.5 mg ox^the protein. 
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Preparation 

5 H9 cells 
( control ) 

C8166 cells 
( control ) 

HIV-infected 
10 H9 cells added 
C8166 cells 
( control ) 

OKT4A ( control ) 

Prep* 1 of 
15 rsT4.7 

Prep. 2 of 
rsT4 . 7 



rsT4 . 7 
(ug/ml) 



S 5.0* 



= 5.0* 



Average 

Syncytia/SOp 1 
alicruot 



128 



35 



% Inhibition 
at 2 Hrs 

N/A 



N/A 



100 

72.7 

98.4 



20 



25 



30 



35 



As demonstrated in these tables, soluble 
T4 protein rsT4.7 inhibited syncytia formation in 
HIV-infected H9 cells. 

We also evaluated the effect of rsT4. 113.1 
and rsT4.111 on the syncytiagenic properties of HIV in 
a co-cultivation assay. We carried out a C8166 cell 
fusion assay as described in Walker et al. , supra. 

We incubated 1 x 10 4 H9 cells chronically 
infected with HTLV-IIIB for 1 hour at 37°C in 5% 
C0 2 , with from 5 to 50 pg/ml rsT4. 113.1 or rsT4.111 
in 150 pi RPHI-1640 media supplemented with 20% 
fetal calf serum in 96-well micro titer plates. We 



*- This concentration was initially believed to 
be 10 Mg/ml, based upon our preliminary approxima- 
tion that 1 unit of absorbance at 280 run ("A- " ) , 
was equivalent to 1 mg of rsT4.2. Upon amin6 8 2cid 
analysis of the protein, however, we found that it 
had a higher extinction coefficient than originally 
approximated, with 1 A-- 0 unit of rsT4.2 being 
equivalent to 0.5 mg or^the protein. 
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then added 3 x 10 4 C8166 cells to the wells in 50 pi 
aliguots. The plates were incubated for 2 hours at 
37°C in 5% C0 2 and, following this incubation, the 
number of syncytia per well were counted. 
5 Syncytia were defined as cells containing 

a ballooning cytoplasm greater than three cell 
diameters. All samples were counted twice. Parallel 
co-cultivation used OKX4A alone or rsT4.3 alone at a 
concentration of 25 pg/ml (positive controls) or H9 
10 cells alone or C8166 cells alone (negative controls). 
The results of this assay are shown below and in 
Figure 41. 

INHIBITION IN C8166 FUSION ASSAY 



15 



2Q 



Preparation 


rsT4(ua/ml ) 


X Inhibition 


H9 cells (control) 


0 


0 


C8166 cells (control) 


0 


0 


rsT4. 113.1 


1.25 


35 


rsT4. 113.1 


2.5 


63 


rsT4. 113.1 


4.25 


63 


rsT4. 113.1 


6.25 


82 


rsT4. 113.1 


12.5 


96 


rsT4.3 


12.5 


100 


OKT4A (25 Mg/ml) 


0 


100 



As demonstrated in this table and in 
25 Figure 41, rsT4- 113.1 exhibited a dose-dependent 

inhibition of HIV-induced syncytia formation. The 
molar specific inhibitory activity of rsT4. 113.1 
appeared to be reduced by an order of magnitude by 
comparison to anti-viral activity of longer forms of 
30 recombinant soluble T4. Thus, whereas rsT4. 113.1 is 
effective toward neutralization of HIV-dependent 
cell fusion in vitro , its molar specific inhibitory 
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activity is decreased by a factor of 10. It is 
undetermined whether this decreased potency is due 
to incompl te renaturation of the E . coli -derived 
protein, the presence of three additional amino 
acids at the N- terminus of rsT4. 113.1 (Met-Gln-Gly ) 
lacking in rsT4-2 or rsT4 3 produced in mammalian 
cells, or the absence of additional structure in 
rsT4. 113.1 required for high-affinity binding to 
HIV. 

We also carried out a C8166 cell fusion 
assay with rsT4-lll, as described for rsT4. 113.1- 
The results of this assay are shown below. 

INHIBITION IK C8166 FUSION ASSAY 

X Inhibition 



0 
0 
0 
40 
20 
67 
100 
100 
100 
100 
100 



Preparation rsT4(ua/ml) 

15 H9 cell (control) 0 

C8166 cells (control) 0 

rsT4.111 1-25 

rsT4.111 2.5 

rsT4.111 4.25 

20 rsT4.111 6.25 

rsT4.111 12-5 

rsT4.111 25.0 

rsT4.3 12.5 

rsT4.3 25.0 

25 OKT4A (25 Mg/al) 0 



As demonstrated in this table, rsT4.111 
exhibited a dose-dependent inhibition of HIV-induced 
syncytia formation. At a concentration of 12-5 pg/ml 
and 25.0 pg/ml, complete inhibition of cell fusion 
30 was achieved. 

Kinetics Of Intramuscular injection Of Soluble T4 

We examined the kinetics of the appearance 
of a recombinant soluble T4 protein according to 
this invention (specifically, rsT4.3 from the pBG381- 
35 transfected cell line BG381) in serum after intra- 
muscular injection as follows. 
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We obtained two cynomoigus monkeys ( Macaca 
fascicularis ) who were free of infectxous disease 
and in good health. Each monkey had been subjected 
to a 6 week quarantine period prior to administration 
5 of the soluble T4 protein. Throughout the adminis- 
tration period, each monkey was maintained on a con- 
ventional diet of monkey chow supplemented with fresh 
fruit. A catheter and a vascular access port were 
surgically placed in a femoral vein of each animal 

10 prior to treatment in order to facilitate blood 
collection. 

Over a period of 28 days, each animal 
received recombinant soluble T4 protein twice daily 
by intramuscular injection to the large muscles of 

15 the thighs or buttocks. Injections were administered 
to each animal 8 hours apart and each injection con- 
tained a volume of 0-15 ml/kg (0.25 mg/kg) of rsT4.3 
(from the pBG381- trans formed cell line BG381), for a 
total dose of 0.5 mg/kg/day/monkey . Serum samples 

20 for clearance determination were collected on day 1 
before the first treatment and at 1, 2, 4 and 
8 hours after the first injection, as well as 1, 2, 
4, 14 and 16 hours after the second injection on 
days 7, 14 and 28. 

25 We found that intramuscularly injected 

soluble T4 reached the maximum level in serum between 

1 and 2 hours after injection, with the level falling 
off slowly and reaching half-maximum value at approxi- 
mately 6 hours post- injection. According to data 

30 obtained for intravenous administration (not shown), 
the level of rsT4.3 in serum should drop below that 
attained via intramuscular injection aproximately 

2 hours after intravenous injection. Thus, while 
the maximum rsT4.3 level in serum after intramuscular 

35 injection does not reach that attainable via intra- 
venous injection, it is slowly released into the 
blood stream, remaining detectable in serum for a 
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ffluch 1 nger time. This slow release mechanism asso- 
ciated with intramuscular routes of injection is 
advantageous because a higher level of soluble T4 
protein is available over a longer period of time 
5 over a given concentration; thus remaining in a sus- 
tained level. Intramuscular administration of solu- 
ble T4 protein is particularly useful in treating 
early stage HIV-infected patients, to prevent the 
virus from disseminating, or in treating patients 

10 who have been exposed to the virus and who are not 
yet seropositive. 

We determined serum levels of rsT4.3 using 
an ELISA assay. Throughout this assay, dilutions 
were made in blocking solution and, between each 

15 step, we washed the plates with PBS/0 . 05% Tween-20- 
More specifically, we coated wells of Immulon 2 
plates with .01 OD (280 nm)/ml of OKT4 (IgG2b) in 
0.05 M bicarbonate buffer to a volume of 50 pl/well 
and incubated the plates overnight at 4°C. We then 

20 blocked the plates with 5% bovine serum albumin in 
PBS, 200 MVwell # and incubated for 30 minutes at 
room temperature . 

Subsequently, we added 50 m1 of sample or 
standard to each well, incubating for 4 hours at 

25 room temperature. We then added 50 pl/well of OKT4A 
at 0.1 vg/ml and incubated overnight at 4°C. Using 
a Hyclone Kit (Hyclone) we then carried out the fol- 
lowing steps. First, we added 1 drop of rabbit 
anti-mouse IgG2a to each well and incubated the . 

30 plates for 1 hour at room temperature. We then added 
.100 m! of peroxidase-labeled anti-rabbit IgG, diluted 
1:4000 with 5% BSA/PBS to each well, and incubated 
for 1 hour at room temperature . 

We prepared a substrate reagent as follows . 

35 We diluted substrate reagent 1:10 in distilled water 
and added two O-phenyl-ethylene-diamine ("OPD" ) 
chromophore tablets per 10 ml of substrate. We let 
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the mixture dissolve thoroughl; by mixing with a 
vortex. Alternatively, a TMB peroxidase substrate 
system (Kirkegaard & Perry Catalogue #50-76-00) may 
be used. Subsequently, we added 100 pi of the 
5 chromophore solution to each well, incu b ated for 
10-15 minutes at room temperature and then stopped 
the color development with 100 pi of IN E^SO^. We 
then measured OD at 490 nm, using an ELISA plate 
reader. 

10 The results of the assay are demonstrated 

in the tables below. 
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Monkey #7-91 

rsT4 Level 
(ng/ml) 





Time ( hr ) 


Dav 1 


Dav 7 


UCL y JL** 


Day 




5 


0 


22.7* 


96.5 


158 .0 


1Q 


Q 
• O 




1 


278,8 


199.6 


360.7 


0 P 


•a 
• J 




2 


281.8 


366.8 


306.4 


441 


.1 




4 


214 . 9 


246 . 6 


363.9 


393 


- 2 




5 








290 


.4 


10 


8 


72 .3 


105 • 0 


199.4 








9** 


246.2 












10 


259.6 












12 


136.0 












22 


23.8 










IS 


24 


13.4 












Monkey #7- 


-92 
















rsT4 Level 














( ncr/ml ) 










Time ( hr ) 


Day 1 


Day 7 


Day 14 


Day 28 




20 


0 


6.7* 


56.0 


106.3 


60.9 






1 


87.2 


225.8 


178.0 


437.7 






2 


254.2 


377.9 


253.2 


770.6 






4 


170.0 


167.3 


308.2 


821.5 






5 








898.3 




25 


8 


118.9 


101.2 


176.5 








9** 


405.1 












10 


523.5 












12 


371.5 












22 


48.4 










30 


24 


39.4 












* 


- background 









- second injection administered after the 
collection of the 8 hour sample. 



Polyvalent Forms Of Recombinant Soluble T4 

35 Receptors may be characterized by their 

affinity for specific ligands, such that, at equili- 
brium, the intrinsic affinity (K ) between monovalent 

a 

receptor and monovalent ligand can be defined as 
[RL]/[R f ] [L f ] , where [RL] is the concentration of 
40 receptor (R) bound to ligand (L) and [R f ] and [L^.] 
are the concentrations of free receptor and ligand. 
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respectively [P. A. Underwood, in Advances In Virus 
Research , ed. K. Maramorosch et al . , 34, pp. 283-309 
(1988)] . 

For a polyvalent receptor (with a valency 
5 of n) binding to a polyvalent ligand (with a valency 
of m), a functional affinity can be defined as 
n l R bl/ n I R f l m t L f 1 * where [R^] is the concentration of 
bound receptor sites, and n[R^] and s[L^] are, respec- 
tively, the concentrations of free receptor and 

10 ligand binding sites. The effect of increasing the 
valence (the number of binding sites) is to enhance 
the stability of ligand-receptor complexes. The 
affinity of a polyvalent receptor for a polyvalent 
ligand will depend on three factors: the intrinsic 

15 association constant of each binding site, the 

valency (number of binding sites) and the topico- 
logical relationship between the receptor and ligand 
binding sites. Under some circumstances, polyvalent 
binding interactions will lead to higher functional 

20 affinity. The decreased dissociation rate of poly- 
valent ligands with polyvalent receptors results in 
an increased functional affinity [C. L. Hornick and 
F- Karush, Immunochemistry, 9, pp. 325-40 (1972); 
I. otterness and F. Karush, "Principles Of Antibody 

25 Reactions 11 , in Antibody As A Tool , ed. J. J. 

Marchalonais and G.W. Warr, pp. 97-137 (1982)]. 

The simplest case for receptor polyvalency 
increasing functional affinity is represented by a 
bivalent soluble receptor , such as an antibody 

30 molecule, which has two identical ligand binding 

sites, each capable of independently binding antigen 
with equal affinity. If the antigen is displayed 
polyvalently, for example, chemically coupled to a 
solid support such that the spacing between antigenic 

35 sites can be bridged by the antibody's two antigen 

binding arms, the functional affinity of the antibody 
for the antigen coupled to the solid support would be 
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greater than the intrinsic affinity cf the antibody 
binding site for the monovalent antigen [D. Crothers 
and H. Metzger, Inununochemistry , 9, pp , 341-57 
(1972)]. Because virus particles represent poly- 
5 valent antigens, the greater functional affinity of 
antibodies for polyvalent antigens is an important 
factor for antibody-directed virus neutralization. 

The association of recombinant soluble T4 
and the HIV major envelope glycoprotein gpl20 is an 

10 example of monovalent receptor binding to monovalent 
ligand. The affinity of this interaction has been 
measured, and the association between T4 and gpl20 
has a dissociation constant K d = 4 x 10 M [L. Lasky 
et al.. Cell , 50, pp. 975-88 (1987)]. 

15 Using the antibody analogy, we believe 

that polyvalent rsT4 will demonstrate a greater 
affinity for HIV-infected cells displaying gp!20 
than monovalent rsT4 and the topicological relation- 
ship between gp!20 on the virus particle or the 

20 infected cell surface, will determine the degree to 
which polyvalent rsT4 exhibits higher functional 
affinity than monovalent rsT4. One example of a 
polyvalent rsT4 is described below, with respect to 
the production of a recombinant bivalent rsT4 con- 

25 sisting of two tandem repeats of amino acids 3-178, 

followed by the C- terminal 199 amino acids of rsT4.3. 
According to this invention, a "polyvalent" receptor 
possesses two or more binding sites for a given 
ligand. Furthermore , the intrinsic affinity of each 

30 ligand binding site of a given polyvalent receptor 
need not be identical. 

As shown in Figure 42, to construct bivalent 
rsT4, we digested pBG391 with Nhe l , which cleaves 
after the valine at position 178 in rsT4, and removed 

35 the Nhe l 5' overhang with mung bean nuclease. Next, 
we cleaved with Bql l I to remove the C-terminal half 
of the rsT4 coding sequence in pBG391. Finally, we 

374 1 



WO 89/01940 



PCT/LS88/02940 



-92- 

ligated a Dra l- Bgl 1 1 fragment contemning the coding 
sequence for rsT4 amino acids 3 (lysine) through 377 
(isoleucine) to the cleaved pBG391 to create pBiv.l, 
a plasmid coding for a fusion protein with a tandem 
5 duplication of the N-terminal 176 amino acids of 

rsT4, followed by the C-terminal 199 amino acids of 
rsT4.3. The protein produced by this plasmid, 
therefore, contains two adjacent N-terminal gp!20- 
binding or OKT4A-binding domains (defined by amino 

10 acid residues 3 through 111 of rsT4.111), followed 
by one OKT4-binding C-terminal domain (Figure 43). 

pBiv.l was trans fected by electroporation 
into COS 7 cells to test expression of the bivalent 
rsT4 protein. Three days later, we tested the con- 

15 ditioned medium of the trans fected cells for the 
presence of the rsT4 bivalent protein by immuno- 
precipitation, followed by Western blot analysis of 
the precipitated protein. Both OKT4A and OKT4 were 
used for immuno-precipitation to determine that the 

20 OKT4 epitope and at least one of the OKT4A epitopes 
had folded correctly. Both antibodies precipitated 
a protein of the predicted apparent molecular weight 
(60,000d) from the conditioned medium of the cells. 

Bivalent rsT4 may be purified by immuno- 

25 affinity purification from an OKT4 column and the 

purified protein may then be used to perform quanti- 
tative .competition assays with rsT4.3. We believe 
that the bivalent molecule would demonstrate equi- 
valent competition against rsT4.3 for OKT4 binding, 

30 but significantly greater competition - against mono- 
valent rsT4 for OKT4A binding. The ability of 
bivalent recombinant soluble T4 to block syncytium 
formation may also be demonstrated in the C8166 
fusion assay- We also believe that bivalent 

35 recombinant soluble T4 would block syncytium 

formation at significantly lower concentrations 
than monovalent rsT4 ; based upon the higher 
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functional affinity of bivalent recombinant 
soluble T4 for gpl20. 

According to alternate embodiments of this 
invention, other methods for producing polyvalent 
5 rsT4 may be employed. For example, polyvalent rsT4 
may be produced by chemically coupling rsT4 to any 
clinically acceptable carrier molecule, a polymer 
selected from the group consisting of Ficoll, poly- 
ethylene glycol or dextran, using conventional 

10 coupling techniques. Alternatively, rsT4 may be 
chemically coupled to bio tin, and the biotin-rsT4 
conjugate then allowed to bind to avidin, resulting 
in tetravalent avidin/biotin/rsT4 molecules. And 
rsT4 may be covalently coupled to dinitrophenol 

15 (DNP) or trinitxophenol (TNP) and the resulting 

conjugate precipitated with anti-DNP or ant i -TNP - 
I gin, to form decameric conjugates with a valency of 
10 for rsT4 binding sites. 

Alternatively, a recombinant chimeric 

20 antibody molecule with rsT4 sequences substituted 
for the variable domains of either or both of the 
immunoglobulin molecule heavy and light chains may 
be produced. Because recombinant soluble T4 
possesses gp!20 binding activity, the construction 

25 of a chimeric antibody having two soluble T4 domains 
and having unmodified constant region domains could 
serve as a mediator of targeted killing of HIV- 
infected cells that express gpl20. 

For example, chimeric rsT4/IgG^ may be 

30 produced from two chimeric genes — an rsT4/human 
kappa light chain chimera ( rsT4 / c ] cappa ) and an 
rsT4/human gamma 1 heavy chain chimera 
(rsT^C^^). Both C kappa and C gaBBa . 1 regions 
have been isolated from human recombinant DNA 

35 libraries, and each has been subcloned into animal 
cell selection vectors containing either the 
bacterial neo resistance or bacterial gpt markers 
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for selection in animal cell hosts against the 

antibiotic G418 or mycophenolic acid, respectively. 

To construct rsT4/C and rsT4/C, 

gamma— 1 ' Kappa 

chimeric genes, an rsT4 gene segment, including at 
5 least the secretory signal sequence and the N-terminal 
110 amino acid residues of the mature rsT4 coding 
sequence and including a splice donor or portion 
thereof, is placed upstream of the gamma- 1 and kappa 
constant domain exons. A suitable restriction 

10 enzyme may be used to cut within the intron down- 
stream of the desired rsT4 coding sequence , thus 
providing a donor splice site. Subsequently, a 
suitable restriction enzyme is used to cut within 
the introns upstream of the kappa and gamma-1 

15 coding regions. The rsT4 sequence is then joined to 
the kappa or gamma-1 constant region sequence, such 
that the rsT4 intron sequence is contiguous with the 
gamma-1 and kappa introns. In this way, an acceptor 
splice site is provided by the kappa or gamma-1 

20 constant region intron. Alternatively, rsT4 chimeric 
genes may be constructed without the use of introns, 
by fusing a suitable rsT4 cDNA gene segment directly 
to the gamma-1 or kappa coding regions. 

The rsT4/C gamna-1 and rsT4/C kappa vectors 

25 may then be co trans fee ted, for example, by electro- 
poration into lymphoid or non- lymphoid host cells. 
Following transcription and translation of the two 
chimeric genes, the gene products may assemble into 
chimeric antibody molecules. 

30 Expression of the chimeric gene products 

may be measured by an enzyme-linked immunoadsorbant 
assay ( ELISA ) that utilizes monoclonal anti-T4 anti- 
body OKT4A, as described infra, or in gpl20 competi- 
tion assays and radioimmunoassays, as described infra. 

35 Activity of the rsT4/IgG^ chimeras may be measured 
by incubating them with HIV-infected cells in the 
presence of human complement, followed by quantitating 
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subsequent complement-mediated lyr\s of these cells. 
Alternatively, activity may be measured in HIV repli- 
cation and HIV syncytium assays as described infra. 

In order to determine if bivalent rsT4 has 
5 a greater potency than monovalent rsT4 f we mixed 
OKT4, at various concentrations, together with a 
constant concentration of rsT4, so that the molar 
ratio of OKT4:rsT4 varied between 0.2 and 4. After 
preincubating the mixture overnight at 4°C, we added 

10 aliguots to the HIV syncytium assay described infra. 

OKT4 has no observable effect in this assay when used 
alone. In addition, the concentration of recombinant 
soluble T4 chosen did not cause inhibition in this 
assay- Accordingly , we looked for indications that 

15 the OKT4/rsT4 mixture was more potent than rsT4 alone 
We observed that at ratios of OKT4 : rsT4 greater than 
0.2, partial to complete inhibition of syncytium 
formation occurred. We believe that under conditions 
where two rsT4 molecules are bound to 1 OKT4 molecule 

20 the greatest inhibitory effect should be found. 

Thus, polyvalent, as well as monovalent 
forms of recombinant soluble T4 are useful in the 
compositions and methods of this invention - 

Hicroorganisms and recombinant DNA mole- 

25 cules prepared by the processes of this invention 

are exemplified by cultures deposited in the In Vitro 
International, Inc. culture collection, in Linthicum, 
Maryland, on September 2, 1987, and identified as: 





BG378 


E. 


coli 


MC1061/pBG378 


30 


199-7: 


: E. 


coli 


MC1061/P199-7 




170-2 


E. 


coli 


JA221/pl70-2 




EC100: 


E. 


coli 


JM83/pEC100 




BG377: 


E. 


coli 


MC1061/pBG377 




BG380: 


E. 


coli 


MC1061/pBG380 


35 


BG381: 


E. 


coli 


MC1061/pBG381 




These 


cultures 


were assigned 



0 _ numbers IVI 10143-10149, respectively. 
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In addition, microorganisms and recombinant 
DNA molecules according to this invention are exempli- 
fied by cultures deposited in the In Vitro Interna- 
tional, Inc. culture collection, in Linthicum, 
5 Maryland, on January 6, 1988, and identified as: 
BG-391: E.coli MC1061/pBG391 
BG-392: E.coli MC1061/pBG392 
BG-393: E.coli MC1061/pBG393 
BG-394: E.coli MC1061/pBG394 
xo BG-396: E.coli MC1061/pBG396 

203-5 : E.coli SG936/p203-5 . 
These cultures were assigned accession 
numbers IVI 10151-10156, respectively. 

Microorganisms and recombinant DNA mole- 
15 exiles according to this invention are also exempli- 
fied by cultures deposited in the In Vitro 
International. Inc. culture collection, in Linthicum, 
Maryland, on August 24, 1988 and identified as: 
211-11: E.coli A89/pBG211-ll 
20 214-10: E.coli A89/pBG214-10 

215-7 : E.coli A89/pBG215-7 

These cultures were assigned accession 
numbers IVI 10183-10185 respectively. 

While we have hereinbefore described a 
25 number of embodiments of this invention, it is 

apparent that our basic constructions can be altered 
to provide othe embodiments which utilize the pro- 
cesses and compositions of this invention. There- 
fore ' Wil1 be a PP reciat ed that the scope of this 
invention is to be defined by the claims appended 
hereto rather than by the specific embodiments which 
have been presented hereinbefore by way of example. 



30 
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CLAIMS 

We claim: 

1. A DNA sequence selected from the group 
consisting of: 

5 (a) the DNA inserts of p!99-7, pBG377, 

pBG380, pBG381, p203-5, pBG391, pBG392, pBG393, pBG394, 
pBG395, pBG396, pBG397, j211-11, p214-10 and p215-7; 

(b) DNA sequences which hybridize 

to one or more of the foregoing DNA inserts and which 
10 code on expression for a soluble T4-like polypeptide; 
and 

(c) DNA sequences which code on 
expression for a soluble T4-like polypeptide coded 
for on expression by any of the foregoing DNA inserts 

15 and sequences. 

2. The DNA sequence according to claim 1, 
wherein said DNA sequence (b) codes on expression 
for a soluble T4-like polypeptide which inhibits 
adhesion between T4 + lymphocytes and infective agents 

20 which target T4 lymphocytes and which inhibits 

interaction between T4* lymphocytes and antigen pre- 
senting cells and targets of T4 + lymphocyte mediated 
killing. 

3. A recombinant DNA molecule comprising 
25 a DNA sequence selected from the group consisting of 

the DNA sequences of claim 1 or 2, said DNA sequence 
being operatively linked to an expression control 
sequence- in said- recombinant ..DNA_molecule 

4. The recombinant DNA molecule according 
30 to claim 3, wherein said expression control sequence 

is selected from the group consisting of the early 
or late promoters of SV40 or adenovirus, the lac 
system, the trp system, the TAC system, the TRC 
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system, the major perator and piomoter regions of 
phage X, the contr 1 regions of fd coat protein, the 
promoter for 3-phosphoglycerate kinase or other 
glycolytic enzymes, the promoters of acid phosphatase, 
5 the polyhedron promoter of the baculovirus system 
and the promoters of the yeast a -mating factors. 

5. A unicellular host transformed with a 
recombinant DMA molecule selected from the group 
consisting of the recombinant DNA molecules of claim 3 
10 or 4. 



6. The host according to claim 5, wherein 
said host is selected from the group consisting of 
strains of E.coli , Pseudomonas , Bacillus , 

Strep tomyces , fungi, animal cells, plant cells, 
15 insect cells and human cells in tissue culture. 

7. A polypeptide coded for on expression 
by a DMA sequence selected from the group consisting 
of the DNA sequences of claim 1 or 2, said polypep- 
tide being essentially free of other proteins of 

20 human origin. 

8. The polypeptide according to claim 7, 
wherein said polypeptide is selected from the group 
consisting of a polypeptide of the formula AA _23~ AA 362 
of Figure 3, a polypeptide of the formula AA^ -362 of 

25 Figure 3, a polypeptide of the formula Me t-AA^^g^ 

of Figure 3, a polypeptide of the formula AA 1-374 of 
Figure 3, a polypeptide of the formula Met-AA 1 _ 374 
of Figure 3, a polypeptide of the formula AA 1-377 of 
Figure 3, a polypeptide of the formula Met-AA 1-377 

30 of Figure 3, a polypeptide of the formula AA _23~ AA 374 
of Figure 3, a polypeptide of the formula AA_ 23 -AA 377 
of Figure 3 . 
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9. The polypeptide accrrding to claim 7, 
wherein said polypeptide is selected from the 
group consisting of a polypeptide of the formula 
AA -23" AA 182 of F i<3ure 16, a polypeptide of the 
5 formula AA 1 -AA 1Q2 of Figure 16, a polypeptide of 

the formula Met "- AA 1 _ 182 of Figure 16, a polypeptide 
of the formula AA _ 2 3~ AA 182 of Fi ^u re 16 * followed 
by the amino acids asparagine-leucine-glutamine- 
histidine-serine-leucine, a polypeptide of the formula 
XQ AA 1 -AA 182 of Figure 16, followed by the amino acids 

asparagine-leucine-glutamine-histidine-serine-leucine, 
a polypeptide of the formula Met - A A 1 . 182 o£ Figure 16, 
followed by the amino acids asparagine-leucine- 
glutamine-histidine-serine-leucine, a polypeptide of 
15 the formula AA^a^^llS of Fi 9ure 16, a polypeptide 
of the formula AA 1 - AA 113 of Figure 16, a polypeptide 
of the formula Met: - A A 1-113 of Figure 16, a polypeptide 
of the formula AA^a'^lll of Fi 9ure 16, a polypeptide 
of the formula AA 1 "AA 111 of Figure 16, a polypeptide 
of the formula Met-AA 1-;L11 of Figure 16, a polypep- 
tide of the formula AA . 2 3" AA 13l of F ^r^ 16, a poly- 
peptide of the formula A A 1 "AA 131 of Figure 16, a 
polypeptide of the formula Me^-AA^^^ of Figure 16, 
a polypeptide of the formula AA . 23 -AA ;L45 of Figure 16, 
a polypeptide of the formula AA 1 "AA 145 of Figure 16, 
r polypeptide of the formula Met: - AA 1 . 145 of Figure 16, 
a polypeptide of the formula AA . 23 "AA 166 of Figure 16, 
a polypeptide of the formula AA 1 ~AA ]L66 of Figure 16, 
a polypeptide of the formula Met " AA 1 . 166 of Figure 16, 
30 or portions thereof • 

10. The polypeptide according to claim 7, 

wherein said polypeptide is selected from the group 

consisting of a polypeptide of the formula AA „-AA„-, 

— 23 3o2 

of mature T4 protein, a polypeptide of the formula 
35 ^1-362 o£ mature T4 protein, a polypeptide of the 
3 7 f t9 nUla Met " AA l-362 of mature T4 protein, a polypep- 
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tide of the formula AA 1-3?4 of m cure T4 protein, a 

polypeptide of the formula Met-AA- of mature T4 

1-374 

protein, a polypeptide of the formula AA 1-3?? of 
mature T4 protein, a polypeptide of the formula 
Met-AA 1 _ 377 of mature T4 protein, a polypeptide of 
the formula AA_ 23 -AA 374 of mature T4 protein, a poly- 
peptide of the formula AA_ 23 ~AA 377 of mature T4 pro- 
tein, or portions thereof, 

11- The polypeptide according to claim 7, 
wherein said polypeptide is selected from the group 
consisting of a polypeptide of the formula AA _ 2 3~ AA 182 
of mature T4 protein, a polypeptide of the formula 
AA 1 -AA 182 of mature T4 protein, a polypeptide of the 
formula Met-AA 1-182 of mature T4 protein, a polypep- 
tide of the formula AA _ 2 3~ AA 182 of mature T4 protein, 
followed by the ami no acids asparagine-leucine- 
glutamine-histidine-serine-leucine, a polypeptide of 
the formula AA 1 " AA 18 2 of mature T4 protein, followed 
by the amino acids asparagine-leucine-glutamine- 
histidine-serine-leucine, a polypeptide of the formula 
Met-AA 1-182 of mature T4 protein, followed by the 
amino acids aspaxagine^leucine-glutamine-histidine- 
serine- leucine, a polypeptide of the formula 
AA_ 23 -AA 113 of mature T4 protein, a polypeptide of 
the formula AA X -AA 113 of mature T4 protein, a polypep- 
tide of the formula Met " AA 1 . 113 of mature T4 protein, 
a polypeptide of the formula AA^S'^lll of ma ture 
T4 protein, a polypeptide of the formula AA 1 -AA 111 
of mature T4 protein, a polypeptide of the formula 
Met-AA lwlll of mature T4 protein, a polypeptide of 
the formula AA_ 23 -AA 131 of mature T4 protein, a poly- 
peptide of the formula AA 1 " AA 131 of mature T4 protein, 
a polypeptide of the formula Met " AA 1 _ 131 of mature 

T4 protein, a polypeptide of the formula AA „-AA„ MF 

-23 145 

of mature T4 protein, a polypeptide of the formula 
AA 1 -AA 145 of mature T4 protein, a polypeptide of the 
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f nnula Met-AA i-i45 o£ mature T4 protein, a polypep- 
tide of the formula AA . 2 3"" AA 166 of mat:ure T4 protein, 

a polypeptide of the formula AA, -AA, of mature T4 

i. loo 

protein, a polypeptide of the formula Met-AA^^^^ of 
5 mature T4 protein, or portions thereof. 

12. A method for producing a polypeptide 
selected from the group consisting of the polypeptides 
of any one of claims 7 to 11 comprising the step of 
culturing a unicellular host transformed with a recom- 
10 binant DNA molecule selected from the group consisting 
of the recombinant DNA molecules of claim 3 or 4. 



13 . A pharmaceutical composition comprising 
an immuno therapeutic or immunosuppressive effective 
amount of a polypeptide selected from the group con- 

15 sisting of the polypeptides of any one of claims 7 to 
11 and a pharmaceutical ly acceptable carrier. 

14. A method for treating patients com- 
prising the step of treating them in a pharmaceuti- 
cal^ acceptable manner with a composition selected 
from the group consisting of the composition of 
claim 13. 



20 



15. The method according to claim 14, 
wherein the patient is treated by intramuscular 
injection of the composition. 

25 16. A diagnostic composition for detecting 

or- for monitoring the course of HIV infection com- 
prising a diagnostic effective amount of a polypeptide 
selected from the group consisting of the polypeptides 
of any one of claims 7 to 11. 

30 17. A method for detecting or for moni- 

toring the course of HIV infection comprising the 
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step of employing as a diagnosti - a composition 
selected from the group consisting oZ the compositions 
of claim 16. 

18. A means for detecting or for monitoring 
the course of HIV infection comprising a composition 
selected from the group consisting of the compositions 
of claim 16. 

19. A pharmaceutical composition compris- 
ing an immunotherapeutic or immunosuppressive effec- 
tive amount of antibody to a polypeptide selected 
from the group consisting of the polypeptides of any 
one of claims 7 to 11 and a pharmaceutical ly accept- 
able carrier. 

20. A method for treating patients com- 
prising the step of treating them in a pharmaceutic 
cally acceptable manner with a composition according 
to claim 19. 

21. The use of a polypeptide selected 
from the group consisting of the polypeptides of any 
one of claims 7 to 11 to purify HIV virus. 

22. The use according to claim 20 , wherein 
the HIV virus is purified from a biological sample. 

23. A method for purifying HIV virus from 
_ a_ sample comprising the^ step of _expo_sing the J»nplf_ 

to a polypeptide selected from the group consisting 
of the polypeptides of any one of claims 7 to 11. 

24. The method according to claim 22, 
wherein the sample is a biological sample. 
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25- A DNA sequence com- rising the DNA 
insert: of pl70-2, said sequence c dine, on expression 
for a T4-like polypeptide . 

26. A recombinant DNA molecule comprising 
5 a DNA sequence selected from the group consisting of 

the DNA sequence of claim 25, said DNA sequence 
being operatively linked to an expression control 
sequence in said recombinant DNA molecule. 

27. A unicellular host transformed with a 
10 recombinant DNA molecule according to claim 26. 

28. A polypeptide coded for on expression 
by a DNA sequence of claim 25, said polypeptide being 
essentially free of other proteins of human origin. 

29. A pharmaceutical composition comprising 
15 an immunotherapeutic or immunosuppressive amount of a 

soluble protein receptor and a pharmaceutical ly 
acceptable carrier . 

30. A method for treating patients 
comprising the step of treating them in a pharma- 

20 ceutically acceptable manner with a pharmaceutical 
composition of claim 29. 

31. A diagnostic composition for detecting 
or for monitoring the course of viral infection com- 
prising a- diagnostic effective amount of a soluble 

25 protein receptor. 

32. A method for detecting or for moni- 
toring the course of a viral infection comprising 
the step of employing as a diagnostic a diagnostic 
effective amount of a soluble protein receptor. 

3753 

89085519 



-104- 

33. A means for detecting or for moni- 
toring the course of a viral infecticn comprising 
a soluble protein receptor. 

34. A DNA sequence selected from the group 
consisting of: 

(a) the DNA insert of pBiv.l; 

(b) DNA sequences which hybridize to 
the DNA insert of pBiv.l and which code on expression 
for a polyvalent soluble T4-like polypeptide; and 

(c) DNA sequences which code on 
expression for a polyvalent soluble T4-like polypep- 
tide coded for by the DNA insert of pBiv.l. 

35. A recombinant DNA molecule comprising 
a DNA sequence selected from the group consisting of 
the DNA sequences of claim 34, said DNA sequence 
being operatively linked to an expression control 
sequence in said recombinant DNA molecule. 

36. A unicellular host transformed with a 
recombinant DNA molecule according to claim 35. 

37. A polypeptide coded for on expression 
by a DNA sequence selected from the group consisting 
of the DNA sequences according to claim 34, said 
polypeptide being essentially free of other proteins 
of human origin. 

38. The polypeptide according to claim 7, 
wherein said polypeptide is polyvalent. 

39. A method for producing a polyvalent 
polypeptide comprising the steps of: 

(a) culturing a unicellular host 
transformed with a recombinant DNA molecule according 
to claim 3 or 4 to produce a polypeptide; and 
754 
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(b) coupling saia polypeptid to a 
carrier to form a polyvalent polypeptide. 

40. A DNA sequence comprising: 

(a) a first portion comprising a DNA 
sequence coding for the constant region of an immuno- 
globulin light chain; and 

(b) a second portion comprising a 
DNA sequence according to claim i or ?, or portions 
thereof , said second portion being joined upstream 
of said first portion. 

41. A DNA sequence comprising: 

(a) a first portion comprising a DNA 
sequence coding for the constant region of an immuno- 
globulin heavy chain; and 

(b) a second portion comprising a 
DNA sequence according to claim 1 or 2 , or portions 
thereof, said second portion being joined upstream 
of said first portion. 

42. An expression vector comprising the 
DNA sequence according to claim 40. 

43. An expression vector comprising the 
DNA sequence according to claim 41. 

44. An expression vector comprising the 
DNA sequence according to claim 40 and the DNA 
sequence according- to claim 41. — — 

45. A method for producing a chimeric 
rsT4/IgG 1 comprising the step of co- trans fecting a 
host cell with the expression vector according to 
claim 42 and the expression vector according to 
claim 43 . 
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46. A method for producing a chimeric 
rsT4/IgG 1 comprising the step of transfecting a host 
cell with the expression vector according t claim 44. 

47. A chimeric rsT4/IgG 1 produced by the 
method according to claim 45 or 46. 



48. A pharmaceutical composition comprising 
an immunotherapeutic or immunosuppressive effective 
amount of a polypeptide according to claim 37 or 38. 

49. A method for treating patients com- 
prising the step of treating them in a pharmaceutic 
cally acceptable manner with a composition according 
to claim 48. 

50. A diagnostic composition for detecting 
or for monitoring the course of HIV infection com* 
prising a diagnostic effective amount of a polypeptide 
according to claim 37 or 38. 

51. A pharmaceutical composition comprising 
an immunotherapeutic or immunosuppressive effective 
amount of a chimeric rsT4/IgG 1 according to claim 47. 

52. A method for treating patients com- 
prising, the step of treating them in a pharmaceuti- 
cal^ acceptable manner with a composition according 
to claim 51. 
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Sfil 
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5* CTC CCC ACA ?CC TCC ACC CCC GTG CAC CCA ATC TCA 3' 



49 

5' CAT CTC ACA TTG OCT CCA CCC CCC TCC ACC ATC T 3' 



50 

5' TCC ACC CCC CTC CAC CCA ATC CCC CTC ATT TCA 3' 

3' CO CCC CAC CTC OCT TAC CCC CAC TAA ACT CTA C 5' 

51 
41 

5' CAA CAA CCT TCT CCC ACC AAC 3' 
34 

5 • TCC CCA CAC CTC CTC CTC CAA TCC AAC ATC AAC- TCA A 3 • 

3* CTC TCC ACC ACC ACC TTA CCT TCT ACT TCA CTT CTA C 5* 

35, 

SS — - - — - - - ■ — — 

5' CAC CCA CCC AAC CAA ACA AAC TCC 3* 

46 

S- TCC CCA CAC CTC CTC CTC CAA TCC AAC ATC AAC 
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5' CCC CAC AAC CTT CAT CTT CCA TTC CAC CAC CAC CTC TC 3' 
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« >«a3q 
X|2IA1 

CAATTCTTACACTTACTTAAATTGCTAATrTrAT^^ 



1 ♦ ♦ ♦ ♦ * CO 

CTTAA&AATCTC2AATGAA1TTAAC C ATTAAAATATCTAATCTTTTCAATCCTTTAGC TAG 

CluPhaLauHiaLauValLy«LauLauIlaL«utadXl*T*rLy«L«uArgLy«S«Tlla - 

o c * 
k o m 

X X x 
TGGATCCAXXSCAAACAAACTC^TCCTCOCCAAAAAAgC 

A C CTACCTCC l ITilAC CAC G ACCC U 1 1 iniCZ c^TCT^CTTGACTCgA^ "° 
Tr7^^1aClyAjaLy«V«lValX^uGlyty«Ly«CXyA«p7^zVal61ar«a7&rCys - 

a mar km i 

I b b a 
a o o t 
1 * 3 1 

ACX Ul r T CCC AfSAACJU C AC C A T JtCJUl^^ 
121 — ♦ ♦ — * - — ♦ ISO 

T c rc s juus o* ivi^rit,it^ tA TcrtAACCTGx ccx : r + lu Ag urruc tisxi :m :aa 
T^JUAS«xClaLyttLy«S«rZl«01aPh«SisTrpLyaJUa5«rXaaciaXl«X.y«Xl« - 

• s 

mm* r juj 

laps o ynX 
aalp k as a 

4333 X 241 

// // 

CTCCOAAATCACCC 

HI ^ — ■ ■ — — — o 240 

GA CCCT rTJU S T CCCGA CGAACAATTCA «11C ACTXXCT*CCGCCAC?C 

:^aGlyJ^aOXaCXy3«r?h«I*uTtoLysGXyProSarLyaL«aA*&AjpAr9AlAA*p - 

* S 5 1 

LA XAtfaSS BXDXa i A 

a X bvXala cbpda a t 

A a OMMr loaa3 f X 

3 X 2344XX XXX2A 1 2 

/// / // 

TCAAfiAACAAC C JiU I t ^ »C A CC AAgGJUUlCTtTCCCCTGATeAtCAAX^ 

241 IZ r ^ r rZ I 'I^aA ^^ * 300 

S«rAr9Ar^«rL«uTrpA«pOlaGXyA0aPhaProL«aXX«Il<Ly«A«aL«aLy«Il« - 
H S 

ID M M WUM3 M 

ad b a nvuni a 

t • o X U51a • 

II 2 X 124XX 

«ACACTCACATACTrACATCTCTtU^ 

30X * ♦ ♦ * ♦ 3*0 

C :TC l U ACTCTATGAATGTACACACTTCACCrCCTCG IC r iCC rCC T C CACGTTAACSAT 

GlttA£p3«rAApTZirTyTXX*CyaGXuVAlGXttA«pGlatysGludluValGlaL uLau - 

782 RHUUH^iQ 
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C p 
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B X 

GTGTTCGGATrCACTGCCAACTCTCA^ 

3C1 ♦ * ♦ ♦ * * *20 

OlCAAGCCTAACTCACCGTrCAgACTGTGGGTGC^ 

aar s a 

s BBBCHC d m i s 

r «pco«r d a a r 

y oimtpF • 1 t y 

1 221221 1 1 11 

/ /// 

accttcgagagcccccctggtactagcccctcactgcaa^ 
421 * * * "— ^ ♦r— "I 480 

TCGAACCTCTCGGGGCCACCATCATCGCCCAGTCACCTTACA?CC^^ 
-*i r t^uGlttSerPro?roClys«rS«rProS«rValGlaCysArgS«r?roAr5GlyLy» - 

M 38ER SB 3ES 

HMD AaP A Bs«C5*Sc«S 3 M«cc 
b ad lpv 1 aptoxsam * Iror 
o 1 • oBu a alHRApcFXt a anr? 
2 11 122 1 2212121111 1 4121 
// /////// / / 

AACATACA GG GGGCG»ACA r f C rC I C C STG7CT G ACCTGGAGCTCCAGCATAGTGGCACC 

AMI ♦ ♦ 340 

TTqCATGTCCC UCCC I I C l OG GACAGGCACAGACTCGACCrCGAGGTCCTATCACCGTGG 

AjaIl«GlaGlyGlyLysT!irI^aS«rVals«rGlaLaaGlaI^uGlaAapSerGlyT!ir * 
M 

MB M H 

lp b h 

*H o • 

31 2 1 

TGCACATGO UITC r C TT GC ACAA C CXCAAGAAGCTCGAGTTCAAAATAGACATC 
341 ACCTCTACGTGXCACAJ U1G T C IT OU IC I IX. 1 ' lll CACCTCAAJ U ri ' l ' lA TCTCTAGCA C CA C 
TrpnrCy«T!irVAlZ^aGlnAflaGlaI.ysLysValGluPbtt£.ytIl«AspIlttValVal - 

MA HS MM 

a 1 at a a 

• u «u 11 

_ 1 _1 _ 31 1 .1 _ . - - 

/ 

C7AG C: 1 : It CAGAAGGCCTCCAGCATAGTCTATAAGAAAGAGGGGCAACXGGTGGACTTC 

601 „— — ♦ ♦ ♦ - 660 

GATCGAAAGGTCTTCCGGAGGTCGTATCACIATATTCTtTCTCC C CCTTGTCCACCTCAAG 

LatiAia?h«GlaI.y«AlAS«rSAr:ieVairyrLy«Ly«GluGlyCluGlaValGlu?i:* - 

A AM 

1 la 
u a 1 
1 1 1 

TCC7TCCCACTCGCCTTTACAC7TGAAAAGCTGACGGGCACTGGCGAG C -TGTGCTGGCAG 
6fil ♦ — ♦ ♦ 720 

AC^IAAX^T G AGCGGAAATGTCAACT^ 

SarPh ProL«uAlaPbeTbrValCluLy«LauT!xfGlyS«rGlyGluL«uTrpTrpGln - 

3? 83 s.qnxs^iQ 
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MH K* MNDa M 
np al bdpu b 
1& LM «n3 o 
II 11 121A 2 

/ / / 

CCCGACXCCCCTT CC ? CC r C CAACTCTrGGA?CACCrr?GACC7^^ 

721 * * * * ♦ ♦ 73C 

CGCCTCTCCCGAAGCAGCAGCTTCAGAACCTAGTGGAAACTGCA ^^ :u Jill -MU lC 

AlaGluArsAlJLSorS«rSorLy«s«rTrpIl«?::rPhe^ 

* BCS ? S 

•Iff SCC AOMpPOtS A AH 

zm, tor vrluadui 1 1 p 

Zm HUT aa&Hj*9a q o & 

23 121 22411K1 1 i i 

/ . / / // // 

TCT«JuaAr r yy r rrAcc c AGCA CC CT 

7il ♦ ♦ ♦ ♦ ♦ 840 

AGACATTTTGCCC^TGGGTCCTCCGATTCGAGGTC^ 

S«rV*lLy«ArgV*lT^rClaA«pProLy»L«uGlnM«tClyLy»Ly«L«ttProL«ttHi« - 

bzs acs s 

N see H5 0 M H see KKa 

n tor at 4 a P - cor iua 

1 HRT oa • 1 h KKJT 1«9 

X 121 31 1 1 i 121 13« 

/ / / / 

CTCACCCTGCCCCAGGCCTXCCCTOUrrATCC^^ 

•41 ♦ * — — ♦ ♦ ♦ 900 

GAGTGGGACGGGGTCCGGAACGGAGTCATACGACCCAGACCTrTGGAG^ 

I^uThrl^uProClaAlal^ttJ^rcKUaTyr^ - 

S BCS 

t «CC BO A 

• tor p d 1 

V m ho o 

1 121 111 

GAAGCGAAAACAGGAAAgTTGCXTCAGGJUUrTGAACCTGGTGCT 

901 — ♦ ♦ — ♦ - ♦ ♦ 9(0 

CTTCGllTl'TUlU.i illAACCTACT CC T'r CA CrTGGACCACCACTACrCTCGGTGACTC 

GluAl«XysT2irClyLyaLottHl«GlnGluVAlA«aLouV«lV«lM«tAr9AlAT&rGla - 

PS S 

M ADNNpPaS f D MA D 

n vrliuaui i d nl d 

j. aaaaMa*n M — • la • 

1 22441141 1 1 11 1 

/////// 

CTCCJUIAAAAArrrGACCTCTCUCGTGTGGGGACC 



961 



1020 



GA^TCTTTTTAAACTGGACACTC CACACC C CTGGGTGGAGGGGATTC GACTAC GACTCA 
LauGlaLy«A*nl^uThrCy»GluV«lTrpGlyProTbrS«rProLy«LauMotL«uS«r - 

8908^519 

a • a 

1 q 1 

3784 11 1 

ttouuu:tggagaac\ac^ggcaaagctctcc 

X021 * * * ♦ ♦ ~ 1080 

aactttgacctctt g ttcctccgtttccagagcttcgccctcttccgccacacccacg^ 

LauLysLduGluAjaLysGluAlaLytValSflrty«ArgGluLysAl«74iTrpV4lLott - 
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H PS K 

OH f 0 M 1 X ADpPftS i 

6* d 4 n v vru«ui n 

•t ft ft f ft 4ftMft9a f 

12 113 11 221161 1 

/ // // 

AACCCTGAGGCGGG^TCTGGCAGTCTCTGCTCX^^ 
CTGGGACTCCGCCCCTACACCraACA ra 

AftnFroClttAlaGlyM«tTrpCinCytt«uL«oS«rAftpSerClyclaV«lXauL«u^ltt - 

h s r 5 F 

HSA1T KKMC a BMDNftJC *XP 

lacaa p«cr a btopduh u« 

alccq apiF 4 voa«3o 4t 

31121 2111 H 1112A2 HI 

TCCAACATCAAGGTTtrrCCCCACATC^ 

♦1* 1 — — — ♦ — ^ 1200 

AGGTTGTACTTCCAACAC G CCTCTACCA^ "°° 

s«rAftaIl«LysV4li^uProThrTrpS«rTtoProv*lGla^oM«^^p?roXl4 - 
PS ps s^ 0 ^ 

A MB ADHHpKPftS ADNpPaS M5ID*X 
1 ab vt11msu± vrlasal bepuh 
a lv u«aM«i)a ft*«Mftfta o«a3o 
1 11 224411161 2241161 I21A2 

// // / // / //// / // 

CCCCAgCTrG GGC A CCC TAra 1 1 A TTTTCJUirrGGgACATCCt^TrCT 

CGGGTCGAACCCCTGGGATCTCCAGGGGJUUUU^ 1260 

XlaClaL«uClyA«pProXrgGlyProL«uPh#TyrPh«GlttX^uClyA«pProA«aS«r - 

'Clal 

M A CT AMtf A 

1 1 14 l*h 1 

ft « m nmm o 

3 1 If 111 l 

/ 

CATGTTTCACACCTrXTCATCCATAAGCTAG^ 

1261 ~?rr- * * * ♦ * 1320 

CrACAAACTGTCGAAXXGTACCTATTCGATC GAAATTAC CC CATCAAA^ACTCTCAATTT 

HisV«l£ndGlaL«aXl«Xl«AftpLyftt^uAl«L«uM«rArgSadPhttIlftThrValXys . 

. . 89085519 , S , 

* 1 o— a h m - 

» • k P • 5a 

TTGCTAACCCAG7CAGCCXCCCTGTATGAAA 

AACGATTCCCTCAC?CCCTGCCACATACTTTJU1ATTC~ 

L«ul^uThrClaS«rClyThrV*lTyrCluI leEaeciaCy ■AlaHiftArgHi.ProArg - 

S BES s s 

K MMf SCC F KM R XMHaHc 

1 aoa tor o Pft ft pftcuar 

4 ftlN MR? - * ap 4 IipiSftF 

♦ 311 121 1 21 1 211631 

// / / /// / 

CACCGTC^CCTC^TGCTCrTAa^ 

1341 * * * * 1440 

J785 CTGGCAGTCXX^CC-AeGACATCeSTATCCCaUU: 

HiftApgKiftProGlyCyftCyftArgHiftArg^euGlyCyrAIaGlyTlirAlaGlyProL u - 
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M e KBf a KM i HH5 

a atoa u ha a haa 

1 * 4 «• P «*M 

IV 311 H 11 1 121 

/ 

GCGGCAIATCGTCC*TTCCGACAGCATCGCCAGTCACTATG« 

1441 — * ♦ * - - 1300 

CGCCCTATJUXJU^AAGGCTGTCGTAGCGGTCAG^ 

XiaClyTyrArgProPhaAr^GlaKiaAr^OlaSarLattTrT^Ar^xiaXlaSarAlall* - 

H BB r 

irat »9$f c an 

aaha pxa f tu 

Ppar lAp r «4 

1111 212 1 3H 

/ // 

TGCGTTGATGCA AlVlt-^ TGCGCACCCGTTCT C GGAGCACT^ 

1301 * * ♦ * 1360 

ACGCAACTACGTTAAACXTA CG CGTGGGCXjUSXGCCTCGTGACAC 

CysValA^pAlaZlttSarMacArgThrAxgSarAx^s^rTtirVAlAr^ProLatt'rrpPro - 

F F S 

a If T niCfDaT H 

a 1 * ubdpafc 1 

4 a q Doaa3a a 

H 4 1 212DU 3 

/ // 

CCGCCCA U rU-IUIZCGCI :CGCTACrTOgAGCCACSATCCACTXCGCGATCAT<»CGAC 

1341 — ♦ ♦ ♦ ♦ ♦ ♦ 1S20 

GCCCCCTCAGGACC AftCTiA Ary^TCAACCTCCCTGATAGCTCATCCGCTACTACCCCTC 

ProyroSarProAaaJ^PhmJUaTteTrpSarHlaTyrAr^LaqArgXapHisGlyAap - 

H S H H 

i HM CEHHEKHf HMEX9UBHIN 

a pa fgappaaa paaiixfrr>ana 

1 ap raaahpaH apaaDaMPr 

1 21 11321111 2112111211 

/ / //// / / /// 

CACXCCC U Xt:C I C T C G Xr ? C TC rA CCCCGGJU:GCXTCGTGGCCGGCATCACCGGCSCCXC 

1621 ♦ ♦ ♦ * - * 1«80 

CTGTGGGCAGGACXCCTAAfiJUSATGCCGCCTGCGTAG^ 

HisThrArgProValAjpSarl^uArt^^ - 
s k a s a 

Mf a A<jKBH 1 HW a - - . MNOa 3iMM 

la ahifrhanal p bdpu apba 

&K oaDaa*Pra a oaa3 alop 

41 121112114 1 121A 2222 

// / //// / / /// 

AGCTGCG U f T CC ri^ gGCCTXtATCGCCGACX?CACCCIXTGGCGAJU^TCGCCC?CCCCX 

1681 *~ * * * * * 1740 

TCCXCCCCAACGACCGCGGAXXTXGCGGCTGTAGTGGCTACCCCTTCTAGCCCGAXSCGGT 

AxgCyaGlyCyaTrpAxgl^uTyrAxgArgHlaaiaAxgTrpGlyXrgSarGlySarPro - 

a a ss 

BaK N i SB DHJfPa C HKKHc 

apa la ha QQOQk^l Q ralau f apscr 

alp aPaa O3U0 J J13 aaaa9 r eapiP 

222 3 1 12 23416 1 32111 

// / /// 

CTTCGCGCTCXTGLXCCCCTTGTT?CGGCGTCGGTATGGTGGCACGCCCCGTCGCCGGGGG 

17 41 — ♦ — ♦ — * * — • — ♦ 18 00 

GAAGCCCGAG7AC7CGCGAACAAAGCCGCACCCA7ACCACCGTCCGGGGCACCGGCCCCC 

lauArgAdaHltGlcKf^LauPteAr^^lyTyrGlyGlyAQproArgGlyAr^ly • 
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X C T G TTGGCCCCCJVT C T CC TTGCACCCACCATTCCTTCCSGC 
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?^rV«lClyXr9Hi4L«uL^ttXlaArg?!irXl«ProCy«GlyClyGlyAlaGlsAr9?ro - 
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CAA C CTACTA C T CCCC rC C I ' iU CTAXTGCAC^AgrCCCXTJUUXXIJUIAGC G T CS TCCGAT 

1861 * * * * * * 1920 

CTTCCXTGJ a CJL CCa U C CAAGGACTA^ 

CiaProT&rT&rClytAuX^aProAaxULlAGlyVAlAlaEadGlyi^gXlAS^rS^rAA^ - 
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ccccTTGxgAC curitJ uurccxcrrcx ^ 

1921 ♦ — ♦ — — — ♦ ♦ * 1980 

CGGCAA C T C T CC CJUUgTGGGTCACTCGAGGAAGG C CAC CCGCGCC CC^ 

XlA£^ttGlaS«rI^aCl&ProS«rClaI^uI^ttPrbValGlyXl*GlyHi«A«pTyrAr9 - 

F T H 

n M M B H HHM a i BH 

o b 1 4 1 ps* o a bi 

4o a a « apa 4 ? aa 

B 2 3 14 211 B 1 12 

// 

CGCCGCACTTATGX C T C T C "C Z I rA TCATGCAACTCGTAGGACXCGTGCCCGCAGCGCr 

1981 ♦ ♦ * * ♦ - 2040 

GCGGCSTGAATXCTCACAGAACAAATAGTACGTTCAGCATCCT^ 

ArgArgTirtyrAapCyal^oi^uTyrHiaAlAThrArg^ - 
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CTGCCTC X ' rrr r CC G C GXGGACCG C :r:t. CCTCCAGCGCGACGXTGATCCGCCrCTCGCr 

2041 * ♦ * ♦ ♦ —.-—2100 

GACCCAGTAAAACCCCCT CC T G G C GAAfcGCCACCTC^ 

LauGlyBisPheArQXrgGlyProLAuSarLauGluArgXipA^pAApAr^ProValAlA - 
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TGCGGTATTCGGXA7CTTGGJLCGCCCTCGCTCUIGCC7TC 

3 7 872101 * ♦ — * ♦ ♦ 2160 

ACGCCATAXGCCTTAGAACCTGCGGGAGCGAGTTCGGAAGCAGTGACCAGGGCGCTGGTT 



CyBGlyllAArcXAaLauXlaArgProArgSerSerLauArcHIsrrpSarArgHisGla - 
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PBG39S 



BG368 backoont C //J /^T 

SOluDlt T4#3 r IkJ 

AA #3 = LYS 

Dg381.se0 Length: 61S1 



3802 



1 


GAATTAATTC 


CAGCTTGCTG 


TGGAATGTGT 


GTCAGTTAGG 


GTGTGGAAAG 


5 1 


TCCCCACCCT 


CCCCAGCAGG 


CAGAAGTATG 


CAAAGCATGC 


ATCTCAATTA 


10 1 


CTCAGCAACC 


AGGTGTGGAA 


AGTCCCCAGG 


CTCCCCAGCA 


GGCAGAAGTA 


151 


TGCAAACCAT 


GCATCTCAAT 


TAGTCAGCAA 


CCATAGTCCC 


GCCCCTAACT 


201 


CCCCCCATCC 


CGCCCCTAAC 


TCCGCCCAGT 


TCCGCCCATT 


CTCCGCCCCA 


251 


TGGCTGACTA 


ATTTTTTTTA 


TTTATGCAGA 


GGCCGAGGCC 


GCCTCGGCCT 


301 


CTGAGCTATT 


CCAGAAGTAG 


TGAGGAGGCT 


TTTTTGGAGG 


GGTCCTCCTC 


351 


GTATAGAAAC 


TCGGACCACT 


CTGAGACGAA 


GGCTCGCGTC 


CAGGCCAGCA 


401 


CGAAGGAGGC 


TAAGTGGGAG 


GGGTAGCGGT 


CGTTGTCCAC 


TAGGGGGTCC 


451 


ACTCGCTCCA 


GGGTGTGAAG 


ACACATGTCG 


CCCTCTTCGG 


CATCAAGGAA 


501 


GGTGATTGGT 


TTATAGGTGT 


AGGCCACGTG 


ACCGGGTGTT 


CCTGAAGGGG 


551 


GGCTATAAAA 


GGGGGTGGGG 


GCGCGTTCGT 


CCTCACTCTC 


TTCCGCATCG 


601 


CTGTCTGCGA 


GGGCCAGCTG 


TTGGGCTCGC 


GGTTGAGGAC 


AAACTCTTCG 


651 


CGGTCTTTCC 


AGTACTCTTG 


GATCGGAAAC 


CCGTCGGCCT 


CCGAACGGTA 


701 


CTCCGCCACC 


GAGGGACCTG 


AGCGAGTCCG 


CATCGACCGG 


ATCGGAAAAC 


751 


CTCTCGAGAA 


AGGCGTCTAA 


CCAGTCACAG 


TCGCAAGGTA 


GGCTGAGCAC 


801 


CGTGGCGGGC 


GGCAGCGGGT 


GGCGGTCGGG 


GTTGTTTCTG 


GCGGAGGTGC 


851 


TGCTGATGAT 


GTAATTAAAG 


TAGGCGGTCT 


TGAGACGGCG 


GATGGTCGAG 


901 


GTGAGGTGTG 


GCAGGCTTGA 


GATCGATCTG 


GCCATACACT 


TGAGTGACAA 


951 


TGACATGCAC 


TTTGCCTTTC 


TCTCCACAGG 


TGTCCACTCC 


CAGGTCCAAC 


1001 


TGGATCCAAG 


CTTCGACTCG 


AGGAATTCCC 


CGAAGGAACA 


AAGCACCCTC 


1051 


CCCACTGGGC 


TCCTGGTTGC 


AGAGCTCCAA 


GTCCTCACAC 


AGATACGCCT 


HOI 


GTTTGAGAAG 


CAGCGGGCAA 


GAAAGACGCA 


AGCCCAGAGG 


CCCTGCCATT 


1151 


TCTGTGGGCT 


CAGGTCCCTA 


CTGGCTCAGG 


CCCCTGCCTC 


CCTCGGCAAG 


1201 


GCCACAATGA 


ACCGGGGAGT 


CCCTTTTAGG 


CACTTGCTTC 


TGGTGCTGCA 


1251 


ACTGGCGCTC 


CTCCCAGCAG 


CCACTCAGGG 


AAAGAAAGTG 


GTGCTGGGCA 


1301 


AAAAAGGGGA TACAGTGGAA 


CTGACCTGTA 


CAGCTTCCCA 


GAAGAAGAGC 


1351 


AT AC A ATTCC 


ACTGGAAAAA 


CTCCAACCAG 


ATAAAGATTC 


TGGGAAATCA 


i4u: 


GGGCTCCTTC 


TTAACVA4G 0 GTCC>XCCAA 


GCTGAATGAT 


CGCGCTGACT 
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1451 


CAAGAAGAAG 


1501 


CTTAAGATAG 


1551 


GGAGGAGGTG 


1601 


ACCTGCTTCA 


1651 


AGTAGCCCCT 


1701 


GGGGAAGACC 


1751 


GGACATGCAC 


1601 


ATCGTGGTGC 


1651 


GGGGGAACAG 


1901 


TGACGGGCAG 


195. 1 


AAGTCTTGGA 


2001 


GGTTACCCAG 


2051 


TCACCCTGCC 


2101 


CTGGCCCTTG 


2151 


GGTGATGAGA 


2201 


GACCCACCTC 


2251 


GCAAAGGTCT 


2301 


GGGGATGTGG 


2351 


CCAACATCAA 


2401 


CTGATTTGAG 


2451 


TGGACAAACT 


2501 


TTAAGTGTAT 


2551 


GA-TTCCAACC 


2601 


ATGAGGAAAA 


~ 2651 


GCTACTGCTG 


2701 


AGAAGACCCC 


2751 


CTGTGTTTAG 


2B01 


GaaaaaGCTG 


2651 


CTTTATAAGT 


2901 


CTCCACACAG 


2951 


TGTACCTTTA 


3001 


GTATAGTGCC 


3051 


GTT^TACTTG 



FIG. l5(con1 

CTTGTGGGAC CAAGGAAACT 
AAGACTCAGA TACTTACATC 
CAATTGCTAG TGTTCGGATT 
GGGGCAGAGC CTGACCCTGA 
CAGTGCAATG TAGGAGTCCA 
CTCTCCGTGT CTCAGCTGGA 
TGTCTTGCAG AACCAGAACA 
TAGCTTTCCA GAAGGCCTCC 
GTGGAGTTCT CCTTCCCACT 
TGGCGAGCTG TGGTGGCAGG 
TCACCTTTGA CCTGAAGAAC 
GACCCTAAGC TCCAGATGGG 
CCAGGCCTTG CCTCAGTATG 
AAGCGAAAAC aggaaagttg 

GCCACTCAGC TCCAGAAAAA 

CCCTAAGCTG atgctgagtt 
CGAAGCGGGA GAAGGCGGTG 
cagtgtctgc TGAGTGACTC 
GGTTCTGCCC ACATGGTCGA 
ATCTTTGTGA AGGAACCTTA 
ACCTACAGAG ATTTAAAGCT 
AATGTGTTAA ACTACTGATT 
TATGGAACTG ATGAATGGGA 
CCTGTTTTGC TCAGAAGAAA 
ACTCTCAACA TTCTACTCCT 
AAGGACTTTC CTTCAGAATT 
TAATAGAACT CTTGCTTGCT 
CACTGCT AT A CAAGAAAATT 
AGGCATAACA GTTATAATCA 
GCATAGAGTG TCTGCTATTA 
GCTTTTTAAT TTGTAAAGGG 
TTGACTAGAG ATCATAATCA 
C T T T A ftAg | ft) fcCff Cfi A C A C 



TfCCCCTGAT CATCAAGAAT 
TGTGAAGTGG AGGACCAGAA 
GACTGCCAAC TCTGACACCC 
CCTTGGAGAG CCCCCCTGGT 
AGGGGTAAAA ACATACAGGG 
GCTCCAGGAT AGTGGCACCT 
AGGTGGAGTT CAAAATAGAC 
AGCATAGTCT ATAAGAAAGA 
CGCCTTTACA GTTGAAAAGC 
CGGAGAGGGC TTCCTCCTCC 
AAGGAAGTGT CTGTAAAACG 
CAAGAAGCTC CCGCTCCACC 
CTGGCTCTGG AAACCTCACC 
CATCAGGAAG TGAACCTGGT 
TTTGACCTGT GAGGTGTGGG 
TGAAACTGGA GAACAAGGAG 
TGGGTGCTGA ACCCTGAGGC 
GGGACAGGTC CTGCTGGAAT 
CCCCGGTGCA GCCAATGGCC 
CTTCTGTGGT GTGACATAAT 
CTAAGGTAAA TATAAAATTT 
CTAATTGTTT GTGTATTTTA 
GCAGTGGTGG AATGCCTTTA 
TGCCATCTAG TGATGATGAG 
C C A A A A A A G A A G A G A A A GGT 
GCTAAGTTTT TTGAGTCATG 
TTGCTATTTA CACCACA AAG 
ATGGAAAAAT ATTCTGTAAC 
TAACATACTG TTTTTTCTTA 
ATAACTATGC TCAAAAATTG 
GTTAATAAGG AATATTTGAT 
GCCATACCAC ATTTGTAGAG 
CTCCQCCTGA ACCTGAAACA 
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FIS. 15 (cont'd) 

^ATTGTTG TTGTTAACTT GTT 



3X01 


TAAAATGAAT 


GCAATTGTTG 


TTGTTAACTT 


GTTTATTGCA 


GCTTATAATG 


3151 


GTT ACAAAT A 


AAGCAATAGC 


ATCACAAATT 


TCACAAATAA 


AGCATTTTTT 


3201 


TCACTGCATT 


CTAGTTGTGG 


TTTGTCCAAA 


CTCATCAATG 


TATCTTATCA 


3251 


TGTCTGGATC 


CTCTACGCCG 


GACGCATCGT 


GGCCGGCATC 


ACCGGCGCCA 


3301 


CAGGTGCGGT 


TGCTGGCGCC 


TATATCGCCG 


ACATCACCGA 


TGGGGAAGAT 


3351 


CGGGCTCGCC 


ACTTCGGGCT 


CATGAGCGCT 


TGTTTCGGCG 


TGGGTATGGT 


3401 


GGCAGGCCCG 


TGGCCGGGGG 


ACTGTTGGGC 


GCCATCTCCT 


TGCATGCACC 


3451 


ATTCCTTGCG 


GCGGCGGTGC 


TCAACGGCCT 


CAACCTACTA 


CTGGGCTGCT 


3501 


TCCTAATGCA 


GGAGTCGCAT 


AAGGGAGAGC 


GTCGACCGAT 


GCCCTTGAGA 


3551 


GCCTTCAACC 


CAGTCAGCTC 


CTTCCGGTGG 


GCGCGGGGCA 


TGACTATCGT 


3601 


CGCCGCACTT 


ATGACTGTCT 


TCTTTATCAT 


GCAACTCGTA 


GGACAGGTGC 


3651 


CGGCAGCGCT 


CTGGGTCATT 


TTCGGCGAGG 


ACCGCTTTCG 


CTGGAGCGCG 


3701 


ACGATGATCG 


GCCTGTCGCT 


TGCGGTATTC 


GGAATCTTGC 


ACGCCCTCGC 


3751 


TCAAGCCTTC 


GTCACTGGTC 


CCGCCACCAA 


ACGTTTCGGC 


GAGAAGCAGG 


3601 


CCATTATCGC 


CGGCATGGCG 


GCCGACGCGC 


TGGGCTACGT 


CTTGCTGGCG 


3851 


TTCGCGACGC 


GAGGCTGGAT 


GGCCTTCCCC 


ATTATGATTC 


TTCTCGCTTC 


3901 


CGGCGGCATC 


GGGATGCCCG 


CGTTGCAGGC 


CATGCTGTCC 


AGGCAGGTAG 


3951 


ATGACGACCA 


TCAGGGACAG 


CTTCAAGGAT 


CGCTCGCGGC 


TCTTACCAGC 


4001 


CTAACTTCGA 


TCACTGGACC 


GCTGATCGTC 


ACGGCGATTT 


ATGCCGCCTC 


4051 


GGCGAGCACA 


TGGAACGGGT 


TGGCATGGAT 


TGTAGGCGCC 


GCCCTATACC 


4 101 


TTGTCTGCCT 


CCCCGCGTTG 


CGTCGCGGTG 


CATGGAGCCG 


GGCCACCTCG 


4151 


ACCTGAATGG 


AAGCCGGCGG 


CACCTCGCTA 


ACGGATTCAC 


CACTCCAAGA 


4201 


ATTGGAGCCA 


ATCAATTCTT 


GCGGAGAACT 


GTGAATGCGC 


AAACCAACCC 


4251 


TTGGCAGAAC 


ATATCCATCG 


CGTCCGCCAT 


CTCCAGCAGC 


CGCACGCGGC 


4301 


GGA-TCTCGGG 


CCGCGTTGCT 


GGCGTTTTTC 


CATAGGCTCC- 


GCCCCCCTGA.. 


4351 


CGAGCATCAC 


A A A AATCGAC 


GCTCAAGTCA 


GAGGTGGCGA 


AACCCGACAG 


4401 


GACTATAAAG 


ATACCAGGCG 


TTTCCCCCTG 


GAAGCTCCCT 


CGTGCGCTCT 


445 1 


CCTGTTCCGA 


CCCTGCCGCT 


TACCGGATAC 


CTGTCCGCCT 


TTCTCCCTTC 


45U 1 


UuuAAbLb 1 Li 


GCGCTTTCTC 


AATGCTCACG 






4551 


TGTAGGTCGT 


TCGCTCCAAG 


CTGGGCTGTG 


TGCACGAACC 


CCCCGTTCAG 


4601 


CCCGACCGCT 


GCGCCTTATC 


CGGTAACTAT 


CGTCTTGAGT 


CCAACCCGGT 


4651 


AAGACACGAC 


TTATCGCCAC 


TGGCAGCAGC 


CACTGGTAAC 


AGGATTAGCA 


470: 


GAGCGAGGTA 


TGTAC 8W*^ C 4 GAGT 


TCTTGAAGTG 


GTGGCCTAAC 
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FIG. 15 (cont'd) 



4751 


TACGGCTACA 


CTAGAAGGAC 


AGTATTTGGT 


ATCTGCGCTC 


TGCTGAAGCC 


4801 


AGTTACCTTC 


GGAAAAAGAG 


TTGGTAGCTC 


TTGATCCGGC 


A A AC A A ACCA 


4851 


CCGCTGGTAG 


CGGTGGTTTT 


TTTGTTTGCA 


AGCAGCAGAT 


TACGCGCAGA 


4901 


AAAAAAGGAT 


CTCAAGAAGA 


TCCTTTGATC 


TTTTCTACGG 


GGTCTGACGC 


4951 


TCAGTGGAAC 


GAAAACTCAC 


GTTAAGGGAT 


TTTGGTCATG 


AGATTATCAA 


5001 


AAAGGATCTT 


CACCTAGATC 


CTTTTAAATT 


AAAAATGAAG 


TTTTAAATCA 


5051 


ATCTAAAGTA 


TATATGAGTA 


AACTTGGTCT 


GACAGTTACC 


AATGCTTAAT 


5101 


CAGTGAGGCA 


CCTATCTCAG 


CGATCTGTCT 


ATTTCGTTCA 


TCCATAGTTG 


5151 


CCTGACTCCC 


CGTCGTGTAG 


ATAACTACGA 


TACGGGAGGG 


CTTACCATCT 


5201 


GGCCCCAGTG 


CTGCAATGAT 


ACCGCGAGAC 


CCACGCTCAC 


CGGCTCCAGA 


5251 


TTTATCAGCA 


ATAAACCAGC 


CAGCCGGAAG 


GGCCGAGCGC 


AGAAGTGGTC 


5301 


CTGCAACTTT 


ATCCGCCTCC 


ATCCAGTCTA 


TTAATTGTTG 


CCGGGAAGCT 


5351 


AGAGTAAGTA 


GTTCGCCAGT 


TAATAGTTTG 


CGCAACGTTG 


TTGCCATTGC 


5401 


TGCAGGCATC 


GTGGTGTCAC 


GCTCGTCGTT 


TGGTATGGCT 


TCATTCAGCT 


5451 


CCGGTTCCCA 


ACGATCAAGG 


CGAGTTACAT 


GATCCCCCAT 


GTTGTGCAAA 


5501 


AAAGCGGTTA 


GCTCCTTCGG 


TCCTCCGATC 


GTTGTCAGAA 


GTAAGTTGGC 


5551 


CGCAGTGTTA 


TCACTCATGG 


TTATGGCAGC 


ACTGCATAAT 


TCTCTTACTG 


5601 


TCATQCCATC 


CGTAAGATGC 


TTTTCTGTGA 


CTGGTGAGTA 


CTCAACCAAG 


5651 


TCAT7CTGAG 


AATAGTGTAT 


GCGGCGACCG 


AGTTGCTCTT 


GCCCGGCGTC 


5701 


AACAGGGGAT 


AATACCGCGC 


CACATAGCAG 


AACTTT A A A A 


GTGCTCATCA 


5751 


TTGGAAAACG 


TTCTTCGGGG 


CGAAAACTCT 


CAAGGATCTT 


ACCGCTGTTG 


5601 


AGATCCAGTT 


CGATGTAACC 


CACTCGTGCA 


CCCAACTGAT 


CTTCAGCATC 


5851 


TTTTACTTTC 


ACCAGCGTTT 


CTGGGTGAGC 


AAAAACAGGA 


AGGCAAAATG 


5901 


CCGC AAA A A A 


GGGAATAAGG 


GCGACACGGA 


AATGTTGA AT 


ACTCATACTC 


5951 


TTCCTTTTTC 


AATATTATTG 


A AGC ATTT AT 


CAGGGTTATT 


GTCTCATGAG 


6001 


CGGATACATA 


TTTGAATGTA 


TTTAGAAAAA 


T A A A C A A A T A 


GGGGTTCCGC 


605 1 


GCACATTTCC 


CCGAAAAGTG 


CCACCTGACG 


TCTAAGAAAC 


CATTATTATC 


6101 


ATGACATTA A 


CCTATAAAAA 


TAGGCGTATC 


ACGAGGCCCT 


TTCGTCTTCA 


6151 


A 


80085519 







3805 



CUSSTtTUTE SHEET 



50/93 



P3G392 

rBG366 backb n _ g _ 

: soluble T4#7 / [2 If? 

r AA #3 = LVS ' 9 t\~* 

: 182AA+6AA 

:from 203-5 

bg392.seq Length: 6149 

1 GAATTAATTC CAGCTTGCTG TGGAATGTGT GTCAGTTAGG GTGTGGAAAG 

51 TCCCCAGGCT CCCCAGCAGG CAGAAGTATG CAAAGCATGC ATCTCAATTA 

101 GTCAGCAACC AGGTGTGGAA AGTCCCCAGG CTCCCCAGCA GGCAGAAGTA 

151 TGCAAAGCAT GCATCTCAAT TAGTCAGCAA CCATAGTCCC GCCCCTAACT 

201 CCGCCCATCC CGCCCCTAAC TCCGCCCAGT TCCGCCCATT CTCCGCCCCA 

251 TGGCTGACTA ATTTTTTTTA TTTATGCAGA GGCCGAGGCC GCCTCGGCCT 

301 CTGAGCTATT CCAGAAGTAG TGAGGAGGCT TTTTTGGAGG GGTCCTCCTC 

351 GTATAGAAAC TCGGACCACT CTGAGACGAA GGCTCGCGTC CAGGCCAGCA 

401 CGAAGGAGGC TAAGTGGGAG GGGTAGCGGT CGTTGTCCAC TAGGGGGTCC 

451 ACTCGCTCCA GGGTGTGAAG ACACATGTCG CCCTCTTCGG CATCAAGGAA 

501 GGTGATTGGT TTATAGGTGT AGGCCACGTG ACCGGGTGTT CCTGAAGGGG 

551 GGCTATAAAA GGGGGTGGGG GCGCGTTCGT CCTCACTCTC TTCCGCATCG 

601 CTGTCTGCGA GGGCCAGCTG TTGGGCTCGC GGTTGAGGAC AAACTCTTCG 

651 CGG T CTTTCC AGTACTCTTG GATCGGAAAC CCGTCGGCCT CCGAACGGTA 

701 CTCCGCCACC GAGGGACCTG AGCGAGTCCG CATCGACCGG ATCGGAAAAC 

751 CTCTCGAGAA AGGCGTCTAA CCAGTCACAG TCGCAAGGTA GGCTGAGCAC 

801 CGTGGCGGGC GGCAGCGGGT GGCGGTCGGG GTTGTTTCTG GCGGAGGTGC 

851 TGCTGATGAT GTAATTAAAG TAGGCGGTCT TGAGACGGCG GATGGTCGAG 

901 GTGAGGTGTG GCAGGCTTGA GATCGATCTG GCCATACACT TGAGTGACAA 

1 _ 951 TGACATCCAC TTTGCCTTTC TCTCCACAGG TGTCCACTCC CAGGTCCAAC_ 

1G01 TGGATCCAAG CTTCGACTCG AGGAATTCCC CGAAGGAACA AAGCACCCTC 

1051 CCCACTGGGC TCCTGGTTGC AGAGCTCCAA GTCCTCACAC AGATACGCCT 

1101 GTTTGAGAAG CAGCGGGCAA GAAAGACGCA AGCCCAGAGG CCCTGCCATT 

1151 TCTGTGGGCT CAGGTCCCTA CTGGCTCAGG CCCCTGCCTC CCTCGGCAAG 
MET 

1201 gccac WatgU accggggagt cccttttagg cacttgcttc tggtgctgca 

Caa-23 

1251 ACTGGCGCTC CTCCCAGCAG CCACT^A§GG AAAGAAAGTG GTGCTGGGCA 
1301 AAAAAGGGGA TACAGTG ^^ {J^^^J^* CAGCTTCCCA GAAGAAGAGC 
135 1 ATACAATTCC ACTGGAAA A A CTCCAACCAG ATAAAGATTC TGGGAAATCA 



1401 


GGGCTCCTTC 


1451 


CAAGAAGAAG 


1501 


CTTAAGATAG 


1551 


GGAGGAGGTG 


1601 


ACCTGCTTCA 


1651 


AGTAGCCCCT 


1701 


GGGGAAGACC 


1751 


GGACATGCAC 


1801 


ATCGTGGTGC 


1851 


GGGAACAGGT 


1901 


ACGGGCAGTG 


1951 


GTCTTGGATC 


2001 


TTACCCAGGA 


2051 


ACCCTGCCCC 


2101 


GGCCCTTGAA 


2151 


TGATGAGAGC 


2201 


CCCACCTCCC 


2251 


AAAGGTCTCG 


2301 


GGATGTGGCA 


2351 


AACATCAAGG 


2401 


GATTTGAGAT 


2451 


GACAAACTAC 


2501 


AAGTGTATAA 


2551 


TTCCAACCTA 


2601 


GAGGAAAACC 


2651 


TACTGCTGAC 


2701 


' AAGACCCCAA 


2751 


GTGTTTAGTA 


2801 


AAAAGCTGCA 


2851 


TTATAAGTAG 


2901 


CCACACAGGC 


2951 


TACCTTTAGC 


3 8 0? 001 


ATAGTGCCTT 



5,/?3 F I G. l6(cont'd) 

AACTAAAG GTCCATCCAA GCTGA 



GCTGAATGAT CGCGCTGACT 



STOP 




GGGGT TAATAAGGAA TATTTGATGT 



:agc cataccacat ttgtagaggt 
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3808 



305 
310 
315 
320 
325 
330 
335 
340 
345 
350 
355 
360 
365 
370 
375 
360 
385 
390 
395 
400 
405 
410 
415 
420 
425 
430 
435 
440 
445 
450 
455 
460 
465 
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TTTACTTGCT 
AAATGAATGC 
TACAAATAAA 
ACTGCATTCT 
TCTGGATCCT 
GGTGCGGTTG 
GGCTCGCCAC 
CAGGCCCGTG 
TCCTTGCGGC 
CTAATGCAGG 
CTTCAACCCA 
CCGCACTfAT 
GCAGCGCTCT 
GATGATCGGC 
AAGCCTTCGT 
ATTATCGCCG 
CGCGACGCGA 
GCGGCATCGG 
GACGACCATQ 
AACTTCGATC 
CGAGCACATG 
GTCTGCCTCC 
CTGAATGGAA 
TGGAGCCAAT 
GGCAGAACAT" 
ATCTCGGGCC 
AGCATCACAA 
CTATAAAGAT 
TGTTCCGACC 
GAAGCGTGGC 
TAGGTCGTTC 
CGACCGCTGC 
GACACGACTT 



FIG. l6(cont'd) 

TTAAAAAACC TCCCACAC^T CCCCCTGAAC 
AATTGTTGTT GTTAACTTGT TTATTGCAGC 
GCAATAGCAT CACAAATTTC ACAA ATAAAG 
AGTTGTGGTT TGTCCAAACT CATCAATGTA 
CTACGCCGGA CGCATCGTGG CCGGCATCAC 
CTGGCGCCTA TATCGCCGAC ATCACCGATG 
TTCGGGCTCA TGAGCGCTTG TTTCGGCGTG 
GCCGGGGGAC TGTTGGGCGC CATCTCCTTG 
GGCGGTGCTC AACGGCCTCA ACCTACTACT 
AGTCGCATAA GGGAGAGCGT CGACCGATGC 
GTCAGCTCCT TCCGGTGGGC GCGGGGCATG 
GACTGTCTTC TTTATCATGC AACTCGTAGG 
GGGTCATTTT CGGCGAGGAC CGCTTTCGCT 
CTGTCGCTTG CGGTATTCGG AATCTTGCAC 
CACTGGTCCC GCCACCAAAC GTTTCGGCGA 
GCATGGCGGC CGACGCGCTG GGCTACGTCT 
GGCTGGATGG CCTTCCCCAT TATGATTCTT 
GATGCCCGCG TTGCAGGCCA TGCTGTCCAG 
AGGGACAGCT TCAAGGATCG CTCGCGGCTC 
ACTGGACCGC TGATCGTCAC GGCGATTTAT 
GAACGGGTTG GCATGGATTG TAGGCGCCGC 
CCGCGTTGCG TCGCGGTGCA TGGAGCCGGG 
GCCGGCGGCA CCTCGCTAAC GGATTCACCA 
CAATTCTTGC GGAGAACTGT GAATGCGCAA 
ATCCATCGCG TCCGCCATCT CCAGCAGCCG 
GCGTTGCTGG CGTTTTTCCA TAGGCTCCGC 
AAATCGACGC TCAAGTCAGA GGTGGCGAAA 
ACCAGGCGTT TCCCCCTGGA AGCTCCCTCG 
CTGCCGCTTA CCGGATACCT GTCCGCCTTT 
GCTTTCTCAA TGCTCACGCT GTAGGTATCT 
GCTCCAAGCT GGGCTGTGTG CACGAACCCC 

ac " j m&*>w ca tcttoaotcc 

ATCGCCACTG GCAGCAGCCA CTGGTAACAG 



CTGAAACATA 

TTATAATCGT 

CATTTTTTTC 

TCTTATCATG 

CGGCGCCACA 

GGGAAGATCG 

GGTATGGTGG 

CATGCACCAT 

GGGCTGCTTC 

CCTTGAGAGC 

ACTATCGTCG 

ACAGGTGCCG 

GGAGCGCGAC 

GCCCTCGCTC 

GAAGCAGGCC 

TGCTGGCGTT 

CTCGCTTCCG 

GCAGGTAGAT 

TTACCAGCCT 

GCCGCCTCGG 

CCTATACCTT 

CCACCTCGAC 

CTCCAAGAAT 

ACCAACCCTT 

CACGCGGCGC 

CCCCCTGACG 

CCCGACAGGA 

TGCGCTCTCC 

CTCCCTTCGG 

CAGTTCGGTG 

CCGTTCAGCC 

AACCCGGTAA 

GATTAGCAGA 
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FIG. 



l6(cont'd) 



4701 
4751 
4601 
4B51 
4901 
495X 
5001 
5051 
5101 
5151 
5201 
5251 
5301 
5351 
5401 
5451 
5501 
5551 
5601 
5651 
5701 
5751 
580-1 
5851 
5901 
5951 
6001 
6051 
6101 



GCGAGGTATG 


TAGGCGGTGC 


TACAGAGTTC 


TTGAAGTGGT 


GGCCTAACTA 


CGGCTACACT 


AGAAGGACAG 


TATTTGGTAT 


CTGCGCTCTG 


CTGAAGCCAG 


TTACCTTCGG 


AAAAAGAGTT 


GCTAGCTCTT 


GATCCGGCAA 


ACAAACCACC 


GCTGGTAGCG 


GTGGTTTTTT 


TGTTTGCAAG 


CAGCAGATTA 


CGCGCAGAAA 


AAAAGGATCT 


CAAGAAGATC 


CTTTGATCTT 


TTCTACGGGG 


TCTGACGCTC 


AGTGGAACGA 


aaactcacgt 


TAAGGGATTT 


TGGTCATGAG 


ATTATCAAAA 


AGGATCTTCA 


CCTAGATCCT 


TTTAAATTAA 


AAATGAAGTT 


TTAAATCAAT 


CTAAAGTATA 


tatgagtaaa 


CTTGGTCTGA 


CAGTTACCAA 


TGCTTAATCA 


GTGAGGCACC 


tatctcagcg 


ATCTGTCTAT 


TTCGTTCATC 


CATAGTTGCC 


TGACTCCCCG 


tcgtgtagat 


AACTACGATA 


CGGGAGGGCT 


TACCATCTGG 


CCCCAGTGCT 


GCAATGATAC 


CGCGAGACCC 


ACGCTCACCG 


GCTCCAGATT 


TATCAGCAAT 


AAACCAGCCA 


GCCGGAAGGG 


CCGAGCGCAG 


AAGTGGTCCT 


GCAACTTTAT 


CCGCCTCCAT 


CCAGTCTATT 


AATTGTTGCC 


GGGAAGCTAG 


agtaagtagt 


TCGCCAGTTA 


ATAGTTTGCG 


CAACGTTGTT 


GCCATTGCTG 


CAGGCATCGT 


GGTGTCACGC 


TCGTCGTTTG 


GTATGGCTTC 


ATTCAGCTCC 


GGTTCCCAAC 


GATCAAGGCG 


AGTTACATGA 


TCCCCCATGT 


TGTGCAAAAA 


AGCGGTTAGC 


TCCTTCGGTC 


CTCCGATCGT 


TGTCAGAAGT 


AAGTTGGCCG 


CAGTGTTATC 


ACTCATGGTT 


ATGGCAGCAC 


TGCATAATTC 


TCTTACTGTC 


ATGCCATCCG 


TAAGATGCTT 


TTCTGTGACT 


GGTGAGTACT 


CAACCAAGTC 


ATTCTGAGAA 


TAGTGTATGC 


GGCGACCGAG 


TTGCTCTTGC 


CCGGCGTCAA 


CACGGGATAA 


TACCGCGCCA 


CATAGCAGAA 


CTTTAAAAGT 


GCTCATCATT 


GGAAAACGTT 


CTTCGGGGCG 


AAAACTCTCA 


AGGATCTTAC 


CGCTGTTGAG 


ATCCAGTTCG 


ATGTAACCCA 


CTCGTGCACC 


CAACTGATCT 


TCAGCATCTT 


TTACTTTCAC 


CAGCGTTTCT 


GGGTGAGCAA 


AAACAGGAAG 


GCAAAATGCC 


GCAAAAAAGG 


GAATAAGGGC 


GACACGGAAA 


TGTTGAATAC 


TCATACTCTT 


CCTTTTTCAA 


TATTATTGAA 


GCATTTATCA 


GGGTTATTGT 


CTCATGAGCG 


GATACATATT 


TGAATGTATT 


TAGAAAAATA 


AACAAATAGG 


GGTTCCGCGC 


ACATTTCCCC 


GAAAAGTOCC 


AC£JG/\CGTC 


TAAGAAACCA 


TTATTATCAT 



GACATTAACC TATAAAAATA GGCGTATCAC GAGGCCCTTT CGTCTTCAA 
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WO 89/01940 
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V CIA OCT TTT CCA CTC A 3* 



5 ' FIG. 18 

5* CAT CTC ACT CCA AAC 3' 

63 

S' CCC CTC ATA CTA A3' 
64 

5' CAT CTT ACT ATC A 3' 

62 

S» CCC CCA GAG CCT CAC CCT CAC CTT CCA CAC CCC C 3' 



68 

5* CCC CCC CCC TCT CCA AGO TCA CCC TCA CCC TCT C 3' 

69 

S' CCS CCT ACT ACC CCC TCA CTC CAA TCA 3* 

70 

$• GAT CTC ATT CCA CTC AGG CCC TAC TAC 3* 



21 

5* CCS CCT ACT ACC CCC TCA CTC CAA TCT AGG ACT C 3 ' 



12 

3» TAC CAC TCC TAC ATT CCA CTG ACC CCC TAC TAC 3* 



73 

5* CTA CCC CTA AAA ACA TAC ACC CCC CCA AGA CCT CA 3' 



74 

' 5 ' CAT CTC ACC TCT TTC CCC CCC TCT ATC TTT TTA CCC 3 ' 

2S 

3' CCA CCA TAC TCC CAC CTC CAC ATC CAC TCT CTT CCA 
CAA CTC A 3' 

76 

5' GAT CTC ACT TCT CCA ACA CAC TCC ATC TCC ACC TCC 
CAC TAT CCT CCA CCT 3' 

" 38ii 89085519 
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BG368 oacko • .«■ / /O 

soluOle T4#5 * / W. / ^ 

A A #3 = LVS 

first 113 A A of T4 

oasical ly up to V1J1 

09394 . seq Length: 5365 



3812 



1 


uAA I I A A I 1 


r a r /** t t r*rr 


TuGAATGTG T 


GTCAGTTAGG 


GTGTGGAAAG 


5 1 




F*F*r*rAr*F*AF+F* 


^ A ^ A A 0* ^0> A 

CAGAAGTATG 


CAAAGCATGC 


ATCTCAATTA 


101 


' r t f* a r f* a » r*r 


AUuTuTuuAA 


AGTCCCCAGG 


0* V 0* 0* 0* M 0* 0* M 

CTCCCCAGCA 


GGCAGAAGTA 


1 C t 

lt> I 


TP.rAAiT.rAT 


r.rAtrTrAAT 

ulA 1 V- 1 LAA 1 


Ta^T^a^^a a 

tagtlagcaa 


F*F*A^AF*^F*F*F* 

CCATAGTCCC 


GCCCCTAACT 




rrr^rrrATrr 

ILuLLLA 1 LV. 


rr.rrrrTA »r 
uuum 1 A AC 


Tf*f*r*r*^^A f* ^ 
TLUGCtUAGT 


^ F* F* F* 0* 0* 0* A T T 

TCCGCCCATT 


M A A A M 4» A _ 

ctccgcccca 


^ c * 
L> * 




A TTTTTTTT A 

A I 1 1 1 I I I I A 


TTT" A ^ A ^ A 

TTTATGCAGA 


F* F* F* F* F* A F* F* F* F* 

GGwlGAGGCC 


0* 0+ ^0 0* 0* 0* 0* 0*^ 

GCCTCGGCCT 


JU : 


/- Tf.Ar.rT ATT 


F* r A CL A A f* T A F* 


t r* a f* f* a r* f* e*^F 
T GAGGAGGCT 


^^^t^ f* f* a ^~ <"» 
TTTTTGGAGG 


0* 0* ^0 0* 0*^0 0* 0*^0 0* 
GGTCCTCCTC 


•at 1 
1 


r.TAT Af.A A Af 
U 1 A 1 M M A V» 


Tfr.r.Arr att 

I (.UUAULAv. T 


c*r r a/* a ^* a a 
U TGAGACGAA 


0* 0* 0* ^ ^ ^ r* t 0* 

GGCTCGCGTC 


0* M. 0* F* 0* 0* A 0* 0* a. 

CAGGCCAGCA 


** V 1 


rr a A fin. A /HIP 


1 A A u 1 UUUAU 


uGGTAGuGGT 


CGTTGTCCAC 


^F A F* F* F* F* F* 0* 0* 

TAGGGGG T CC 




AC 1 Lul 1 V.LA 


r.r.r.Tr.Tr a a/" 
uuu 1 13 l uAAu 


a ^ a f* a ^ f* ^ f* f* 
ACACATGTCG 


CCCTCTTCGG 


0+ K ^ 0» K f* F* *. A 

CATCAAGGAA 








a r* F* F* F* tk f* f* 

AGGCCACGTG 


a f* r* ^ ^* ^ x 

ACCGGGTGTT 


F* F*^0 0* A A 0* 0* 0* 0* 

CCTGAAGGGG 


55 1 


^" f* T a T a aa a 
uul 1 A 1 AAAA 


F*f* , F*F*F*T , F*F-F*F* 


GCGCGTTCGT 


CCTCACTCTC 


TTCCGCATCG 


bU 1 


C T C T (~ T C F* f~ A 


F* F* F* F* F* a F* F* ^ F* 


TTGGGCTCGC 


GGTTGAGGAC 


AAACTCTTCG 




r r r rrTTTrr 


ACT A ^*T/*TTr* 

AUT AtTCTTU 


^ a *» 0* a a a ^* 

GATCGGAAAC 


0» 0* 0* ^ 0* 0* 0* 0* 0 m ^0 

CCGTCGGCCT 


CCGAACGGTA 


7 r. * 


rTrrT.rch.cr 


n a cnrL a r ct r 


a f* f* f* a f* t f* f* f* 
AGtGAGTCCG 


CATCGACCGG 


ATCGGAAAAC 


'31 




AuuLu I L. I AA 


f* f* a f* t f* a a ^* 
LUAGTCAUAb 


^ F* F* F* A A f* F* ^F A 

TlGCAAGGTA 


0* 0* 0* w mm * 0» 0* * 0* 

GGCTGAGCAC 


Owl 




ft r a r. r r.r^fi t 

UUlMuUUUU 1 


rLr~mrLT" men 


GT TUi I 1 uTb 


F* F* F* F* A F* F* *T F* F* 

UtGGAGGTGC 




Tr.fr.ATr.AT 


/"» T AATTAAAf* 
VJ 1 AA 1 1 AAAVJ 


T A F* F* F* F* F* T F* T 

TAGGCGGTCT 


^ 0* » 0* a F* F* F* F* F* 

TGAGACGGCG 


F* A ^FF* 0* ^0 0* 0* A 0* 

GATGGTCGAG 


9U1 


G T GAGGTGTG 


GCAGGCTTGA 


GATCGATCTG 


GCCATACACT 


TGAGTGACAA 


95 1 


TGAlATCCaC 


TTTGCCTTTC 


T CTCCACAGG 


-TGTCCACTCC 


CAGGTCCAAC 


1UU 1 


TGG-*CCAAG 


CTTCGACTCG 


AGGAATTCCC 


CGAAGGAACA 


AAGCACCCTC 


1051 


CCCACTGGGC 


TCCTGGTTGC 


AGAGCTCCAA 


GTCCTCACAC 


AGATACGCCT 


1101 


GTT T GAGA AG 


CAoCGGGCAA 


GAAAGACGCA 


AGCCCAGAGG 


CCCTGCCATT 


1151 


T C T G T GGGCT 


CAGGTCCCTA 


CTGGCTCAGG 


CCCCTGCCTC 


CCTCGGCAAG 


1201. 


GCCACAATGA 


ACCGGGGAGT 


CCCTTTTAGG 


CACTTGCTTC 


TGGTGCTGCA 


1251 


ACTGGCGCTC 


CTCCCAGCAG 


CCACTCAGGG 


AAAGAAAGTG 


GTGCTGGGCA 


130 1 


A A a A AGGGGA 


TACAGTGGAA 


CTGACCTGTA 


CAGCTTCCCA 


GAAGAAGAGC 


: 3i : 




ACTGfiAa*>OOTQC>4ACCAG 


ATAAAGATTC 


TGGGAAATCA 
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FIG. 19 (cont'd) 

140 1 oGGCTCCTTC TT A ACTA AAG GTCCATCCAA GCTGAATGAT CGCGCTGACT 

1451 CAAGAAGAAG CTTGTGGGAC CAAGGAAACT TTCCCCTGAT CATCAAGAAT 

1501 CTTAAGATAG AAGACTCAGA TACTTACATC TGTGAAGTGG AGGACCAGAA 

1551 GGAGGAGGTG CAATTGCTAG TGTTCGGATT GACTGCCAAC TCTGACACCC 

1601 ACCTGCTTCA GGGGTGATAG TAAGATCTTT GTGAAGGAAC CTTACTTCTG 

1651 TGGTGTGACA TAATTGGACA AACTACCTAC AGAGATTTAA AGCTCTAAGG 

1701 T A A ATATA A A ATTTTTAAGT GTATAATGTG TTAAACTACT GATTCTAATT 

175 1 GTT-GTGTAT TTTAGATTCC AACCTATGGA ACTGATGAAT GGGAGCAGTG 

IdOl GTGGAATGCC TTTAATGAGG AAAACCTGTT TTGCTCAGAA GAAATGCCAT 

1851 CTAGTGATGA TGAGGCTACT GCTGACTCTC AACATTCTAC TCCTCCAAAA 

1901 AAGAAGAGAA AGGTAGAAGA CCCCAAGGAC TTTCCTTCAG AATTGCTAAG 

1951 T777TTGAGT CATGCTGTGT TTAGTAATAG AACTCTTGCT TGCTTTGCTA 

2001 TTTACACCAC AAAGGAAAAA GCTGCACTGC TATACAAGAA AATTATGGAA 

205 1 AAATATTCTG TAACCTTTAT AAGTAGGCAT AACAGTTATA ATCATAACAT 

21U1 actgtttttt cttactccac acaggcatag agtgtctgct attaataact 

2151 ATGCTCAAAA ATTGTGTACC TTTAGCTTTT TAATTTGTAA AGGGGTTAAT 

2201 AAGGAATATT TGATGTATAG TGCCTTGACT AGAGATCATA ATCAGCCATA 

1*251 CCACATTTGT AGAGGTTTT A CTTGCTTTAA AAAACCTCCC ACACCTCCCC 

730 1 CTGAACCTGA AACATAAAAT GAATGCAATT GTTGTTGTTA ACTTGTTTAT 

235 1 TGCAGCTTAT AATGGTTACA AATAAAGCAA TAGCATCACA AATTTCACAA 

2401 ATAAAGCATT TTTTTCACTG CATTCTAGTT GTGGTTTGTC CAAACTCATC 

245 1 AATGTATCTT ATCATGTCTG GATCCTCTAC GCCGGACGCA TCGTGGCCGG 

1:50- CA^CACCuGC GCCACAGGTG CGGTTGCTGG CGCCTATATC GCCGACATCA 

2551 CCGATGGGGA AGATCGGGCT CGCCACTTCG GGCTCATGAG CGCTTGTTTC 

2601 GGCGTGGGTA 7 G G 7 G G C A G G C C C G 7 GG'C C G GGGGACTGTT -GGGCGCCATC 

2651 "CCTTGCATG CAlCaTTCCT TOCGGCGGCG GTGCTCaaCG GCCTCAACCT 

2701 ACTACTuGoC TGCTTCCTaa TGCAGGAGTC GCATAAGGGA gagcgtcgac 

275 1 CtiAT 0C C C 7 7 GAGAGCC77C A A C C C AGTC A GCTCC^TCCG G7GGGCGCGG 

2801 GGCA7GACTA TCGTCGCCGC AC7T ATGACT GTCT7CT7T A TCATGCAACT 

2851 CGTAGGACAG GTGCCGGCAG CGCTCTGGGT CATTTTCGGC GAGGACCGCT 

2901 T7CGCTGGAG CGCGACGA7G ATCGGCCTGT CGCT7GCGGT ATTCGGAATC 

3813 2951 ^TGCACGCCC TCGCT ^^^0 ( lJc^Q CACT GGTCCCCCCA CCAAACGTTT 

rGGCOAGAAG CAGGCCAT7A . fcSCuGGCAT GGCGGCCGAC GCGCTGGGCT 



ei I t»r* — r-^t • — — — , 



PCI'/US88/02M0 



3 8 14 



305 ; 


ACOT C T TGCT 


GCCGTTCG-.G 


ACGCGAGGCT 


GGATGGCCTT 


CCCCATTATG 


3101 


ATTCTTCTCG 


CTTCCGGCGG 


CATCGGGATG 


CCCGCGTTGC 


AGGCCATGCT 


3151 


G*CC AGGC AG 


GTAGATGACG 


ACCATCAGGG 


ACAGCTTCAA 


GGATCGCTCG 


3201 


CGGCTCTTAC 


CAGCCTAACT 


TCGATCACTG 


GACCGCTGAT 


CGTCACGGCG 


3251 


A TTT ATGCCG 


CCTCGGCGAG 


CACATGGAAC 


GGGTTGGCAT 


GGATTGTAGG 


3301 


CGCCGCCCTA 


TACCTTGTCT 


GCCTCCCCGC 


GTTGCGTCGC 


GGTGCATGGA 


3351 


GCCGGGCCAC 


CTCGACCTGA 


ATGGAAGCCG 


GCGGCACCTC 


GCTAACGGAT 


3401 


TCACCACTCC 


AAGAATTGGA 


GCCAATCAAT 


TCTTGCGGAG 


AACTGTGAAT 


:<45 1 


GCGCAAACCA 


ACCCTTGGCA 


GA A CAT ATCC 


ATCGCGTCCG 


CCATCTCCAG 


3501 


CAGCCGCACG 


CGGCGCATCT 


CGGGCCGCGT 


TGCTGGCGTT 


TTTCCATAGG 


3551 


CTCCGCCCCC 


CTGACGAGCA 


TCACAAAAAT 


CGACGCTCAA 


GTCAGAGGTG 


3601 


GCGAAACCCG 


ACAGGACTAT 


AAAGATACCA 


GGCGTTTCCC 


CCTGGAAGCT 


3651 


CCCTCGTGCG 


CTCTCCTGTT 


CCGACCCTGC 


CGCTTACCGG 


ATACCTGTCC 


3?c; 


GCCTTTCTCC 


CTTCGGGAAG 


CGTGGCGCTT 


TCTCAATGCT 


CACGCTGTAG 


375 : 


G7ATCTCAGT 


TCGGTGTAGG 


TCGTTCGCTC 


CAAGCTGGGC 


TGTGTGCACG 


3B0 1 


AACCCCCCGT 


TCAGCCCGAC 


CGCTGCGCCT 


TATCCGGTAA 


CTATCGTCTT 


3S5 1 


GAGTCCAACC 


CGGTAAGACA 


CGACTTATCG 


CCACTGGCAG 


CAGCCACTGG 


390 ; 


TA AC AGGATT 


AGCAGAGCGA 


GGTATGTAGG 


CGGTGCTACA 


GAGTTCTTGA 


J951 


ASTGGTGGCC 


TAACTACGGC 


TACACTAGAA 


GGACAGTATT 


TGGTATCTGC 


400 : 


GCTCTGCTGA 


AGCCAGTTAC 


CTTCGGAAAA 


AGAGTTGGTA 


GCTCTTGATC 


4051 


CGGC A A AC A A 


ACCACCGCTG 


GTAGCGGTGG 


TTTTTTTGTT 


TGCAAGCAGC 


4 101 


AGATTACGCG 


CAGA A AAA A A 


GGATCTCAAG 


AAGATCCTTT 


GATGTTTTCT 


•ilSl 


ACGGGGTCT.G 


ACGCTCAGTG 


GAACGAAAAC 


TCACGTTAAG 


GGATTTTGGT 


-4201 


Catgagatta 


TCAAAAAGGA 


TCTTCACCTA 


GATCCTTTTA 


AATTA AAAAT 


4251 


GAAG T "TT AA 


ATCAATCTAA 


agtatatatg 


agtaaacttg 


GTCTGACAGT 


430 1 


T A C C A A T GC T 


taatcacca 


GGC ACCTATC 


TC AGC G A T C T 


GTCTATTTCG 


4351 


TTC ATCC ATA 


GTTGCCTGAC 


tccccgtcgt 


GTAGATAACT 


ACGATACGGG 


4-ac : 


A Goo C T T A C C 


ATCTGGCCCC 


agtgctgcaa 


TGATACCGCG 


AGACCCACGC 


4451 


TCACCGGCTC 


CAGATTTATC 


agcaataaac 


CAGCCAGCCG 


GAAGGGCCGA 


4501 


GCGCAGAAGT 


GGTCCTGCAA 


CTTTATCCGC 


CTCCATCCAG 


TCTATTAATT 


4551 


G T TGCCGGGA 


AGCTAGAGTA 


AGTAGTTCGC 


cagttaatag 


TTTGCGCAAC 


460 1 


G* T G T TGCCA 




mm™ 


TCACGCTCGT 


CGTTTGGTAT 


4cS ; 


gj:tt:^ttc 


AGCTCCGGTT 


CCCAACGATC 


aaggcgagtt 


ACATGATCCC 
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Ft G. 19 (cont 'd) 



4701 


CCATGTTGTG 


CAAAAAAGCG 


4751 


AGAAGTAAGT 


TGGCCGCAGT 


4801 


TAATTCTCTT 


ACTGTCATGC 


4851 


AGTACTCAAC 


CAAGTCATTC 


4901 


TCTTGCCCGG 


CGTCAACACG 


4951 


AAAAGTGCTC 


ATCATTGGAA 


5001 


TCTTACCGCT 


GTTGAGATCC 


5051 


TGATCTTCAG 


CATCTTTTAC 


5101 


AGGAAGGCAA 


AATGCCGCAA 


5151 


GAATACTCAT 


ACTCTTCCTT 


5201 


TATTGTCTCA 


TGAGCGGATA 


5251 


AATAGGGGTT 


CCGCGCACAT 


5301 


A A AC C ATT AT 


TATCATGACA 


535 1 


CCCTTTCGTC 


TTCAA 



GTTAGCTCCT TCGGTCCTCC GATCGTTGTC 
GTTATCACTC ATGGTTATGG CAGCACTGCA 
CATCCGTAAG ATGCTTTTCT GTGACTGGTG 
TGAGAATAGT GTATGCGGCG ACCGAGTTGC 
GGATAATACC GCGCCACATA GCAGAACTTT 
AACGTTCTTC GGGGCGAAAA CTCTCAAGGA 
AGTTCGATGT AACCCACTCG TGCACCCAAC 
TTTCACCAGC GTTTCTGGGT GAGCAAAAAC 
AAAAGGGAAT AAGGGCGACA CGGAAATGTT 
TTTCAATATT ATTGAAGCAT TTATCAGGGT 
CATATTTGAA TGTATTTAGA AAAATAAACA 
TTCCCCGAAA AGTGCCACCT GACGTCTAAG 
TTAACCTATA AAAATAGGCG TATCACGAGG 
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Dg396.sea Lengtn: 5518 

1 CAATTAATTC CAGCTTGCTG TGGAATGTGT GTCAGTTAGG GTGTGGAAAG 

51 TCCCCAGGCT CCCCAGCAGG CAGAAGTATG CAAAGCATGC ATCTCAATTA 

101 GTCAGCAACC AGGTGTGGAA AGTCCCCAGG CTCCCCAGCA GCCAGAAGTA 

151 TGCAAAGCAT GCATCTCAAT TAGTCAGCAA CCATAGTCCC GCCCCTAACT 

201 CCGCCCATCC CGCCCCTAAC TCCGCCCAGT TCCGCCCATT CTCCGCCCCA 

251 TGGCTGACTA ATTTTTTTTA TTTATGCAGA GGCCGAGGCC GCCTCGGCCT 

301 CTGAGCTATT CCAGAAGTAG TGAGGAGGCT TTTTTGGAGG GGTCCTCCTC 

351 GTATAGAAAC TCGGACCACT CTGAGACGAA GGCTCGCGTC CAGGCCAGCA 

401 CGAAGGAGGC TAAGTGGGAG GGGTAGCGGT cgttgtccac TAGGGGGTCC 

451 ACTCGCTCCA GGGTGTGAAG ACACATGTCG CCCTCTTCGG CATCAAGGAA 

501 GGTGATTGGT TTATaGGTGT AGGCCACGTG ACCGGGTGTT CCTGAAGGGG 

551 GGCTATAAAA GGGGGTGGGG GCGCGTTCGT CCTCACTCTC TTCCGCATCG 

601 CTGTCTGCGA GGGCCAGCTG TTGGGCTCGC GGTTGAGGAC AAACTCTTCG 

651 CGGTCTTTCC AGTACTCTTG GATCGGAAAC CCGTCGGCCT CCGAACGGTA 

701 CTCCGCCACC GAGGGACCTG AGCGAGTCCG CATCGACCGG ATCGGAAAAC 

751 CTCTCGAGAA AGGCGTCTAA CCAGTCACAG TCGCAAGGTA GGCTGAGCAC 

B01 CGTGGCGGGC GGCAGCGGGT GGCGGTCGGG GTTGTTTCTG GCGGAGGTGC 

851 TGCTGATGAT GTAATTAAAG TAGGCGGTCT TGAGACGGCG GATGGTCGAG 

901 GTGAGGTGTG GCAGGCTTGA GATCGATCTG GCCATACACT TGAGTGACAA 

951 TGACATCCAC TTTGCCTTTC TCTCCACAGG TGTCCACTCC CAGGTCCAAC 
1001 - TGGATCCAAG C-TTCGACT-CG -A GGA A TTC C C C G A A GG A A C A AAGCACCCTC 

1051 CCCACTGGGC TCCTGGTTGC AGAGCTCCAA GTCCTCACAC AGATACGCCT 

1101 GTTTGAGAAG CAGCGGGCA A GAAAGACGCA AGCCCAGAGG CCCTGCCATT 

11S1 TCTGTGGGCT CAGGTCCCTA CTGGCTCAGG CCCCTGCCTC CCTCGGC A AG 

1201 GCCACAATGA ACCGGGGAGT CCCTTTTAGG CACTTGCTTC TGGTGCTGCA 

1251 ACTGGCGCTC CTCCCAGCAG CCACTCAGGG AAAGA A AGTG GTGCTGGGCA 

1301 A A AAAGGGGA TACAGTGGAA CTGACCTGTA CAGCTTCCCA GAAGAAGAGC 

135 1 ATACAATTCC ACTGGA ^ < g r ^^ k ^ C ^ AG ATAAAGATTC TGGGAAATCA 

]4T: GCi^CTCCTTC TTAACTAAAG GTCCATCCAA GCTGAATGAT CGCGCTGACT 



6,/9i FIG. 20 (cont'd) 

1451 CAAGAAGAAG CTTGTGGGAC CAAGGAAACT TTCCCCTGAT CATCAAGAAT 

X501 CTTAAGATAG AAGACTCAGA T ACTTACATC TGTGAAGTGG AGGACCAGAA 

X551 GGAGGAGGTG CAATTGCTAG TGTTCGGATT GACTGCCAAC TCTGACACCC 

1601 ACCTGCTTCA GGGGCAGAGC CTGACCCTGA CCTTGGAGAG CCCCCCTGGT 

1651 AGTAGCCCCT CAGTGCAATG taGGAGTCCa AGGGGTAAAA ACATACAGGG 

1701 GGGGAAGACC CTC TCCGTGT CTCAGCTGGA GCTCCAGGAT AGTGGCACCT 

1751 GGACATGCAC TGTCTTGCAG AACTGAGATC TTTGTGAAGG AACCTTACTT 

1801 CTGTGGTGTG ACATAATTGG ACAAACTACC TaCAGAGATT TAAAGCTCTA 

1851 AGGTAAATAT AAAA TTTTTA AGTGTATAAT GTG T TAAACT ACTGATTCTA 

190 1 ATTGTTTGTG TATJTTAGAT TCCAACCTAT GGAACTGATG AATGGGAGCA 

i951 GTGGTGGAAT GCCT TTAATG AGGAAAACCT GTTTTGCTCA GAAGAAATGC 

2C0 1 catctagtga tgatgaggct actgctgact ctcaacattc tactcctcca 

205 1 AAAAAGAAGA gaaaggtaga agaccccaag gactttcctt cagaattgct 

2101 aagttttttg agtcatgctg tgtttagtaa tagaactctt gcttgctttg 

2151 ctatttacac cacaaaggaa aaagctgcac tgctatacaa gaaaattatg 

2201 GAAAAA7ATT CTGTAACCTT TATAAGTAGG CATAACAGTT ATAATCATAA 

2251 CATACTGTTT TTTCTTACTC CACACAGGCA TAGAGTGTCT GCTATTAATA 

2301 ACTATGCTCA AAAATTGTGT ACCTTTAGCT TTTTAATTTG TAAAGGGGTT 

2351 AATAAGGAAT ATTTGATGTA TAGTGCCTTG ACTAGAGATC ATAATCAGCC 

2401 ATACCACATT TGTAGAGGTT TTACTTGCTT TAAAAAACCT CCCACACCTC 

2451 CCCCTGAACC TGAAACATAA AATGAATGCA ATTGTTGTTG TTAACTTGTT 

2501 TATTGCAGCT TATAATGGTT ACAAATAAAG CAATAGCATC ACAAATTTCA 

2551 CAAATAAAGC ATTTTTTTC A CTGCATTCTA GTTGTGGTTT GTCCAAACTC 

2601 ATC A ATGT AT CTT ATCATGT CTGGATCCTC TACGCCGGAC GCATCGTGGC 

2651 CGGCATCACC GGCGCCACAG GTGCGGTTGC TGGCGCCTAT ATCGCCGACA 

270 1 TCACCGATGG GGA AGATCGG GCTCGCCACT TCGGGCTCAT GAGCGCTTGT 

275 1- TTCGGCGTGG GTATGGTGGC AGGCCCGTGG CCGGGGGACT GTTGGGCGCC 

2801 ATC"CCTTGC ATGCACCATT CCTTGCGGCG GCGGTGCTC A ACGGCCTCA A 

2851 cctactactg ggctgcttcc taatgcagga gtcgcataag ggagagcgtc 

290 1 gaccgatgcc cttgagagcc ttcaacccag tcagctcctt ccggtgggcg 

295 1 cggggcatga ctatcgtcgc cgcacttatg actgtcttct ttatcatgca 

3 8 17 3001 ac'cgtagga caggt 3 ( 5pq c g5 >l jj^ tctg ggtcattttc ggcgaggacc 

305) c:ttt:::tg gagcgcgacg atgatcggcc tqtcgcttgc ggtattcgga 
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FIG. 20 (cont'd) 



3L01 
3151 
3201 
3251 
3301 
3351 
3401 
3453 
3501 
3551 
3601 
3651 
3701 
3751 
3B01 
3851 
3901 
3951 
4001 
4051 
4101 
4151 
4201 
4251 
4301 
4351 
4401 
445 : 
4501 
4551 
4601 
4651 
-?r. 1 



ATCTTGCACG CCCTCGCTCA AGCCTTCGTC ACTGGTCCCG CCACCAAACG 
TTTCGGCGAG AAGCAGGCCA TTATCGCCGG C^TGGCGGCC GACGCGCTGG 
GCTACGTCTT GCTGGCGTTC GCGACGCGAG GCTGGATGGC CTTCCCCATT 
ATGATTCTTC TCGCTTCCGG CGGCATCGGG ATGCCCGCGT TGCAGGCCAT 
GCTGTCCAGG CAGGTAGATG A CGACC ATC A GGGAC AGC T T C A AGGATCGC 
TCGCGGCTCT TACC AGCCTA ACTTCGATCA CTGGACCGCT GATCGTCACG 
GCGATTTATG CCGCCTCGGC GAGCACATGG AACGGGTTGG CaTGGATTGT 
AGGCGCCGCC CTATACCTTG TCTGCCTCCC CGCGTTGCGT CGCGGTGCAT 
GGAGCCGGGC CACCTCGACC TGAATGGAAG CCGGCGGCAC CTCGCTAACG 
GATTCACCAC TCCAAGA ATT GGAGCCAA^C AATTCTTGCG GAGAACTGTG 
AATGCGCAAA CCAACCCTTG GCAGAACATA TCCATCGCGT CCGCCATCTC 
CAGCAGCCGC ACGCGGCGCA TCTCGGGCCG CGTTGCTGGC GTTTTTCCAT 
AGGCTCCGCC CCCCTGACGA GCATCACAAA AATCGACGCT CAAGTCAGAG 
GTGGCGAAAC CCGACAGGAC TATAAAGATA CCAGGCGTTT CCCCCTGGAA 
GCTCCCTCGT GCGCTCTCCT GTTCCGACCC TGCCGCTTAC CGGATACCTG 
TCCGCCTTTC TCCCTTCGGG AAGCGTGGCG CTTTCTCAAT GCTCACGCTG 
taGGTATCTC AGTTCGGTGT AGGTCGTTCG CTCCAAGCTG GGCTGTGTGC 
ACGAACCCCC CGTTCAGCCC GACCGCTGCG CCTTATCCGG TAACTATCGT 
CTTGAGTCCA ACCCGGTAAG ACACGACTTA TCGCCACTGG CAGCAGCCAC 
TGGTAACAGG ATTAGCAGAG CGAGGTATGT AGGCGGTGCT ACAGAGTTCT 
TGAAGTGGTG GCCTAACTAC GGCTACACTA GAAGGACAGT ATTTGGTATC 
TGCGCTCTGC TGAAGCCAGT TACCTTCGGA AAAAGAGTTG GTAGCTCTTG 
•ATCCGGCAAA CAAACCACCG CTGGTAGCGG TGGTTTTTTT GTTTGCAAGC 
AGCAGATTAC GCGCAGAAAA AAAGGATCTC AAGAAGATCC TTTGATCTTT 
TCT ACGGGGT CTGACGCTCA GTGGAACGAA AACTCACGTT AAGGGATTTT 
GGTCATGAGA TTATCAAAAA GGATCTTCAC CTAGATCCTT TTA A ATTAAA 
AATGAAGTTT TAAATCAATC TAAAGTATAT ATGAGTAAAC TTGGTCTGAC 
AGT-ACCAA7 GCTTAATCAG TGAGGCACCT ATCTCAGCGA TCTGTCTATT 
TCGTTCATCC ATAGTTGCCT GACTCCCCGT CGTGTAGATA ACTACGATAC 
GGGAGGGCTT ACCATCTGGC CCCAGTGCTG CAATGATACC GCGAGACCCA 
CGCTCACCGG CTCCAGATTT ATCAGCAATA AACCAGCCAG CCGGAAGGGC 
CGAGCGCAGA AGTGGTrCTG-rAACTTTATC CGCCTCCATC CAGTCTATTA 



AT^OTTGCCG GGAAGCTAGA GTAAGTAGTT CGCCAGTTAA TAGTTTGCGC 
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FIG. 20 (cont'd) 



4751 


AACGTTGTTG 


CCATTGCTGC 


AGGCATCGTG 


GTGTCACGCT 


CGTCGTTTGG 


480 1 


TATGGCTTCA 


TTCAGCTCCG 


GTTCCCAACG 


ATCAAGGCGA 


GTTACATGAT 


4851 


CCCCCATGTT 


GTGCAAAAAA 


GCGGTTAGCT 


CCTTCGGTCC 


TCCGATCGTT 


4901 


GTCAGAAGTA 


AGTTGGCCGC 


AGTGTTATCA 


CTCATGGTTA 


TGGCAGCACT 


4951 


GCATAATTCT 


CTTACTGTCA 


TGCCATCCGT 


AAGATGCTTT 


TCTGTGACTG 


5001 


GTGAGTACTC 


AACCAAGTCA 


TTCTGAGAAT 


AGTGTATGCG 


GCGACCGAGT 


5051 


TGCTCTTGCC 


CGGCGTCAAC 


ACGGGATAAT 


ACCGCGCCAC 


ATAGCAGAAC 


5101 


7TTAAAAGTG 


CTCATCATTG 


GAAAACGTTC 


TTCGGGGCGA 


AAACTCTCAA 


5151 


GGATCTTACC 


GCTGTTGAGA 


TCCAGTTCGA 


TGTAACCCAC 


TCGTGCACCC 


520 1 


AACTGATCTT 


CAGCATCT-^T 


TACTTTCACC 


AGCGTTTCTG 


GGTGAGCAAA 


5 251 


AACAGGAAGG 


CAAAATGCCG 


CAAAAAAGGG 


AATAAGGGCG 


ACACGGAAAT 


5301 


GTTGA A T ACT 


CATACTCTTC 


CTTTTTCAAT 


ATTATTGAAG 


CATTTATCAG 


5351 


GGTf ATTGTC 


TCATGAGCGG 


ATACATATTT 


GAATGTATTT 


AGAAAAATAA 


5401 


ACA A ATAGGG 


GTTCCGCGCA 


CATTTCCCCG 


AAAAGTGCCA 


CCTGACGTCT 


5451 


AAGAAACCAT 


TATTATCATG 


ACATTAACCT 


ATAAAAATAG 


GCGTATCACG 


5501 


AGGCCCTTTC 


GTCTTCAA 
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PBG393 

:BG368 backbone 
:SOluDl T4#8 
:AA #3 = LVS 

: M Pirftct" Stu/f irst 182 AA of 
:bas1cal1y up to V2J2 

bg393.seq Length: 5566 



1 


GAATTAATTC 


CAGCTTGCTG 


51 


TCCCCAGGCT 


CCCCAGCAGG 


101 


GTCAGCAACC 


AGGTGTGGAA 


151 


TGCAAAGCAT 


GCATCTCAAT 


201 


CCGCCCATCC 


CGCCCCTAAC 


251 


TGGCTGACTA 


ATTTTTTTTA 


301 


CTGAGCTATT 


CCAGAAGTAG 


351 


GTATAGAAAC 


TCGGACCACT 


401 


CGAAGGAGGC 


TAAGTGGGAG 


451 


ACTCGCTCCA 


GGGTGTGAAG 


501 


GGTGATTGGT 


TTATAGGTGT 


551 


GGCTATAAAA 


GGGGGTGGGG 


601 


CTGTCTGCGA 


GGGCCAGCTG 


65 1 


CGGTCTTTCC 


AGTACTCTTG 


701 


CTCCGCCACC 


GAGGGACCTG 


751 


CTCTCGAGAA 


AGGCGTCTAA 


801 


CGTGGCGGGC 


GGCAGCGGGT 


851 


- TGCTGATGAT 


GTA ATT A A AG 


901 


GTGAGGTGTG 


GCAGGCTTGA 


951- 


TGACATCCAC 


TTTGCCTTTC 


1001 


TGGATCCAAG 


CTTCGACTCG 


105 1 


CCCACTGGGC 


TCCTGGTTGC 


1101 


GTTTGAGAAG 


CAGCGGGCAA 


1151 


TCTGTGGGCT 


CAGGTCCCTA 


1201 


GCCACAATGA 


ACCGGGGAGT 


1251 


ACTGGCGCTC 


CTCCCAGCAG 


1301 
135 1 


AAAAAGGGGA 
ATACAATTCC 


TACA0T Si9tj 

AC*GGAAAAA 



FIG. 21 

T4 



TGGAATGTGT 


GTCAGTTAGG 


GTGTGGAAAG 


CAGAAGTATG 


CAAAGCATGC 


ATCTCAATTA 


AGTCCCCAGG 


CTCCCCAGCA 


GGCAGAAGTA 


TAGTCAGCAA 


CCATAGTCCC 


GCCCCTAACT 


TCCGCCCAGT 


TCCGCCCATT 


CTCCGCCCCA 


TTTATGCAGA 


GGCCGAGGCC 


GCCTCGGCCT 


TGAGGAGGCT 


TTTTTGGAGG 


GGTCCTCCTC 


CTGAGACGAA 


GGCTCGCGTC 


CAGGCCAGCA 


GGGTAGCGGT 


CGTTGTCCAC 


TAGGGGGTCC 


ACACATGTCG 


CCCTCTTCGG 


CATCAAGGAA 


AGGCCACGTG 


ACCGGGTGTT 


CCTGAAGGGG 


GCGCGTTCGT 


CCTCACTCTC 


TTCCGCATCG 


TTGGGCTCGC 


GGTTGAGGAC 


AAACTCTTCG 


GATCGGAAAC 


CCGTCGGCCT 


CCGAACGGTA 


AGCGAGTCCG 


CATCGACCGG 


ATCGGAAAAC 


CCAGTCACAG 


TCGCAAGGTA 


GGCTGAGCAC 


GGCGGTCGGG 


GTTGTTTCTG 


GCGGAGGTGC 


TAGGCGGTCT 


TGAGACGGCG 


GATGGTCGAG 


GATCGATCTG 


GCCATACACT 


TGAGTGACAA 


TCTCCACAGG 


TGTCCACTCC 


CAGGTCCAAC 


AGGAATTCCC 


CGAAGGA AC A 


AAGCACCCTC 


AGAGCTCCAA 


GTCCTCACAC 


AGATACGCCT 


GAAAGACGCA 


AGCCCAGAGG 


CCCTGCCATT 


CTGGCTCAGG 


CCCCTGCCTC 


CCTCGGCAAG 


CCCTTTTAGG 


CACTTGCTTC 


TGGTGCTGCA 


CCACTCAGGG 


AAAGAAAGTG 


GTGCTGGGCA 


CTCCAACCAG 


CAGCTTCCCA 


GAAGAAGAGC 


ATAAAGATTC 


TGGGAAATCA 
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210: 
215: 
220: 

225: 
230: 
235: 
240: 

245: 

250 : 
255: 

260-: 

265: 
270: 
275: 
280: 
285: 
290: 
295 : 
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" V9i FIG. 2 1 (cont'd) 

CGGCTCCTTC TTAACTAAAG GTCCATCCAA GCTGA ATGAT CGCGCTGACT 
CAAGAAGAAG CTTGTGGGAC CAAGGAAAJT TTCCCCTGAT CATCAAGAAT 
CTTAAGATAG AAGACTCAGA TACTTACATC TGTGAAGTGG AGGACCAGAA 
GGAGGAGGTG CAATTGCTAG TGTTCGGATT GACTGCCAAC TCTGACACCC 
ACCTGCTTCA GGGGCAGAGC CTGACCCTGA CCTTGGAGAG CCCCCCTGGT 
AGTAGCCCCT CAGTGCAATG TAGGAGTCCA AGGGGTAAAA ACATACAGGG 
GGGGAAGACC CTCTCCGTGT CTCAGCTGGA GCTCCAGGAT AGTGGCACCT 
GGACATGCAC TGTCTTGCAG AACCAGAAGA AGGTGGAGTT CAAAATAGAC 
ATCGTGGTGC TAGCTTTCCA GTGAGATCTT TGTGAAGGAA CCTTACTTCT 
GTGGTGTGAC ATAATTGGAC A A ACT AC CT A CAGAGATTTA AAGCTCTAAG 
GTAAATATAA AATTTTTAAG TGTATAATGT GTTAAACTAC TGATTCTAAT 
TGTTTGTGTA TTTTAGATTC CAACCTATGG AACTGATGAA TGGGAGCAGT 
GGTGGAATGC CTTTA ATGAG GAAAACCTGT TTTGCTCAGA AGAAATGCCA 
TCTAGTGATG ATGAGGCTAC TGCTGACTCT CAACATTCTA CTCCTCCAAA 
AAAGAAGAGA AAGGTAGAAG ACCCCAAGGA CTTTCCTTCA GAATTGCTAA 
GTTTTTTGAG TCATGCTGTG TTTAGTAATA GAACTCTTGC TTGCTTTGCT 
ATTTACACCA CAAAGGAAAA AGCTGCACTG CTATACAAGA AAATTATGGA 
AAAATATTCT GTA ACCTTT A TAAGTAGGCA TAACAGTTAT AATCATAACA 
TACTGTTTTT TCTT ACTCCA CACAGGCATA GAGTGTCTGC TATTAATAAC 
TATGCTCAAA AATTGTGTAC CTTTA GCTTT TTAATTTGTA AAGGGGTTAA 
TAAGGAATAT TTGATGTATA GTGCCTTGAC TAGAGATCAT AATCAGCCAT 
ACCACATTTG TAGAGGTTTT ACTTGCTTTA AAA AACCTCC CACACCTCCC 
CC-TGAACCTG AAACATAAAA TGAATGCAAT tgttgttgtt aacttgttta 
TTGCAGCTTA TAATGGTTAC AAATAAAGCA ATAGCATCAC AAATTTCACA 
— A A T A A A GC A T__ T-T-T-T_T_T C A C-T --G C AT-T-CT A GT— T-GTGG-TT-T-GT~C C A A A C TC A T- 
CAA'GTATCT TATCATGTCT GGATCCTCTA CGCCGGACGC ATCGTGGCCG 
GCATCACCGG CGCCACAGGT GCGGTTGCTG GCGCC T AT AT CGCCGACATC 
ACCGATGGGG AAG*"GGGC TCGCCACTTC GGGCTCATGA GCGCTTGTTT 
CGGCGTGGGT ATGGTGGCAG GCCCGTGGCC GGGGGACTGT TGGGCGCCAT 
CTCCTTGCAT GCACCATTCC TTGCGGCGGC GGTGCTC A AC GGCCTCAACC 
TACTACTGGG ctgcttccta ATGCAGGAGT CGCATAAGGG AGAGCGTCGA 
CCGATGCCCT TGAGA ^^^g^^ C J[ < t^ GTC A GCTCCTTCC GGTGGGCGCG 
GGGC* T GACT ATCG^CGCCG C A CTT ATGAC TGTCTTCTTT ATCATGCAAC 
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" /93 FIG. 21 (cont 'd) 

3051 TCGTAGGACA GGTGCCGGCA GCGCTCTGGG TCATTTTCGG CGAGGACCGC 

3101 TTTCGCTGGA GCCCGACGAT GATCGGCC" G TCGCTTGCGG TATTCGGAAT 

3151 CTTGCACGCC CTCGCTCAAG CCTTCGTCAC TGGTCCCGCC ACCAAACGTT 

3201 TCGGCGAGAA GCAGGCCATT ATCGCCGGCA TGGCGGCCGA CGCGCTGGGC 

3251 TACGTCTTGC TGGCGTTCGC GACGCGAGGC TGGATGGCCT TCCCCATTAT 

3301 GATTCTTCTC GCTTCCGGCG GCATCGGGAT GCCCGCGTTG CAGGCCATGC 

3351 TGTCCAGGCA GGTAGATGAC GACCATCAGG GACAGCTTCA AGGATCGCTC 

3401 GCGGCTCTTA CCAGCCTAAC TTCGATCACT GGACCGCTGA TCGTCACGGC 

3451 GATTTATGCC GCCTCGGCGA GCACATGGAA CGGGTTGGCA TGGATTGTAG 

3501 GCGCCGCCCT ATACCTTGTC TGCCTCCCCG CGTTGCGTCG CGGTGCATGG 

3551 AGCCGGGCCA CCTCGACCTG AATGGAAGCC GGCGGCACCT CGCTAACGGA 

3601 TTCACCACTC CAAGAATTGG AGCCAATCAA TTCTTGCGGA GAACTGTGAA 

3651 TGCGCAAACC AACCCTTGGC AGAACATATC CATCGCGTCC GCCATCTCCA 

3 70: GCAGCCGCAC GCGGCGCATC TCGGGCCGCG TTGCTGGCGT TTTTCCATAG 
3751 GCTCCGCCCC CCTGACGAGC ATCACAAAAA TCGACGCTCA AGTCAGAGGT 
3801 GGCGAAACCC GACAGGACTA TAAAGATACC AGGCGTTTCC CCCTGGAAGC 
3851 TCCCTCGTGC GCTCTCCTGT TCCGACCCTG CCGCTTACCG GATACCTGTC 
3901 CGCCTTTCTC CCTTCGGGAA GCGTGGCGCT TTCTCAATGC TCACGCTGTA 

3951 GGTATCTCAG ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac 

4001 GAACCCCCCG TTCAGCCCGA CCGCTGCGCC TTATCCGGTA ACTATCGTCT 

4051 TGAGTCCAAC CCGGTAAGAC ACGACTTATC GCCACTGGCA GCAGCCACTG 

4 101 GTAACAGGAT TAGCAGAGCG AGGTATGTAG GCGGTGCTAC AGAGTTCTTG 
4 151 AAGTGGTGGC CTAACTACGG CTACACTAGA AGGACAGTAT TTGGTATCTG 
4201 CGCTCTGCTG AAGCCAGTTA CCTTCGGAAA AAGAGTTGGT AGCTCTTGAT 
4251- CCGGCAAAGA AACCACCGCT- GGTAGCGGTG" GTTTTTTTGT TTGCAAGCAG 
4301 CAGATTACGC GCAGAAAAAA AGGATCTCAA GAAGATCCTT TGATCTTTTC 
435 1 TACGGGGTCT GACGCTCAGT GGA ACGAAA A CTCACGTTA A GGGATTTTGG 
4401 TCATGAGATT ATCAAAAAGG ATCTTCACCT AGATCCTTTT AAATTAAAAA 
4451 TGAAGTTTTA AATCAATCTA AAGTATATAT GAGTAAACTT GGTCTGACAG 
4501 TTACCAATGC TTAATCAGTG AGGCACCTAT CTCAGCGATC TGTCTATTTC 
4551 GTTCATCCAT AGTJGCCTGA CTCCCCGTCG TGTAGATAAC TACGATACGG 
4601 GAGGGCTTAC CATCTGC ^^ C Q §^§JJ GCA ATGATACCG C GAGACCCACG 

*es: c^caccggct CCAGATTTAT CAGCAATaaa ccagccagcc ggaagggccg 



WO 89/01940 




PCT/US88/02940 



67/93 



FIG. 21 (cont'd) 



4701 


AGCGCAGAAG 


TGGTCCTGCA 


ACTTTATCCG 


CCTCCATCCA 


GTCTATTAAT 


4751 


TGTTGCCGGG 


AAGCTAGAGT 


AAGTAGTTCG 


CCAGTTAATA 


GTTTGCGCAA 


4801 


CGTTGTTGCC 


ATTGCTGCAG 


GCATCGTGGT 


GTCACGCTCG 


TCGTTTGGTA 


4851 


TGGCTTCATT 


CAGCTCCGGT 


TCCCAACGAT 


CAAGGCGAGT 


TACATGATCC 


4901 


CCCATGTTGT 


GCAAAAAAGC 


GGTTAGCTCC 


TTCGGTCCTC 


CGATCGTTGT 


4951 


CAGAAGTAAG 


TTGGCCGCAG 


TGTTATCACT 


CATGGTTATG 


GCAGCACTGC 


5001 


ATAATTCTCT 


TACTGTCATG 


CCATCCGTAA 


GATGCTTTTC 


TGTGACTGGT 


5051 


GAGTACTCAA 


CCAAGTCATT 


CTGAGAATAG 


TGTATGCGGC 


GACCGAGTTG 


5101 


CTCTTGCCCG 


GCGTCAACAC 


GGGATAATAC 


CGCGCCACAT 


AGCAGAACTT 


5151 


T AAA AGTGCT 


CATCATTGGA 


AAACGTTCTT 


CGGGGCGAAA 


ACTCTCAAGG 


5201 


ATCTTACCGC 


TGTTGAGATC 


CAGTTCGATG 


TAACCCACTf 


GTGCACCCAA 


5251 


CTGATCTTCA 


GCATCTTTTA 


CTTTCACCAG 


CGTTTCTGGG 


TGAGCAAAAA 


5301 


CAGGAAGGCA 


AAATGCCGCA 


AAAAAGGGAA 


TAAGGGCGAC 


ACGGAAATGT 


5351 


TGAATACTCA 


TACTCTTCCT 


TTTTCAATAT 


TATTGAAGCA 


TTTATCAGGG 


5401 


TTATTGTCTC 


ATGAGCGGAT 


ACATATTTGA 


ATGTATTTAG 


A A A A AT A A AC 


5451 


AAAT AGGGGT 


TCCGCGCACA 


TTTCCCCGAA 


AAGTGCCACC 


TGACGTCTAA 


5501 


GAAACCATTA 


TTATCATGAC 


ATTAACCTAT 


AAAAAT AGGC 


GTATCACGAG 


5551 


GCCCTTTCGT 


CTTCAA 
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pa(S 395 

80368 DaCMCjOr.tr 

so I uD i e T4# 10 

A A #3 = LVS 

first 131 A A of T4 



F/fi 22 



3824 



og395.sea Length. 5413 

: gaattaattc cagcttgctg tggaatgtgt gtcagttagg 
£1 tccccaggct ccccagcagg cagaagtatg caaagcatgc 

0 1 GtCAGCAACC AGGTGTGGAA AGTCCCCAGG CTCCCCAGCA 

TGCAAAGCAT GCATCTCAA^ TAGTCAGCAA CCATAGTCCC 

CCGCCCATCC CGCCCCTAAC TCCGCCCAGT TCCGCCCATT 

TGGCTGACTA ATTTTTTTTA TTTATGCAGA GGCCGAGGCC 

CTGAGCTATT CCAGAAGTAG TGAGGAGGCT TTTTTGGAGG 

GTataoaaaC tcggaccact ctgagacgaa ggctcgcgtc 
cgaaggaggc taagtgggag gggtagcggt cgttgtccac 

ACTCGCTCCA GGGTGTGAAG ACACATGTCG CCCTCTTCGG 
GG-GATTGGT TTATAGGTGT AGGCCACGTG ACCGGGTGTT 
GGCTATAAAA GGGGGTGGGG GCGCGTTCGT CCTCACTCTC 
CTGTCTGCGA GGGCCAGCTG TTGGGCTCGC GGTTGAGGAC 
CGGTCTTTCC AGTACTCTTG GATCGGAAAC CCGTCGGCCT 
CTCCGCCACC GAGGGACCTG AGCGAGTCCG CATCGACCGG 
C'CTCGAGAA AGGCGTCTAA CCAGTCACAG TCGCAAGGTA 
CGTGGCGGGC GGCAGCGGGT GGCGGTCGGG GTTGTTTCTG 
TGCTGATGAT GTaaTTaaaG TAGGCGGTCT TGAGACGGCG 
GTGAGGTG^G GC AGGC TTGA GATCGATCTG GCCATACACT 
TuAC-A'CCAC TTTGCCTTTC TCTCCACAGG TGTCCACTCC 
TGCATCCAAG CTTCGACTCG AGGAATTCCC CGAAGGAACA 

1051 C C C a C T GGGC— T-C G T-GGTTGC -AGAGGTC C A A~ GTC CT C A CA C _ 

110 1 GT^TGAGAAG CAGCGGGC A A GAAAGACGCA AGCCCaGAGG 
:i5 1 TCTGTGGGCT CAGGTCCCTa CTGGCTCAGG CCCCTGCCTC 
WOl GCCACAATGA ACCGGGGAGT CCCTTTTAGG CACTTGCTTC 
1251 ACTGGCGCTC CTCCCAGCAG CCACTCAGGG AAAGAAAGTG 
1301 a a a a AGGGGA TACAGTGGAA CTGACCTGTA CAGCTTCCCA 
1351 A T A C A A TT Z C AC^GGAAAAA CTCCAACCAG ATAAAGATTC 
HCi GCOCTCCTTC a a CT A g A ^Q G g ^ A GCTGAATGAT 



151 
?0 J 
25 1 
3C : 
1 

4c: 

451 

«:oi 

551 
601 
o5l 

?0l 

75: 

901 

es\ 

901 
951 
100 1 



GTGTGGAAAG 

ATCTCAATTA 

GGCAGAAGTA 

GCCCCTAACT 

CTCCGCCCCA 

GCCTCGGCCT 

GGTCCTCCTC 

CAGGCCAGCA 

TAGGGGGTCC 

CATCAAGGAA 

CCTGAAGGGG 

TTCCGCATCG 

AAACTCTTCG 

CCGAACGGTA 

ATCGGAAAAC 

GGCTGAGCAC 

GCGGAGGTGC 

GATGGTCGAG 

TGAGTGACAA 

CAGGTCC^-Z 

AAGCACCCTC 

AGATACGCCT 

CCCTGCCATT 

CCTCGGCAAG 

TGGTGCTGCA 

GTGCTGGGCA 

GAAGAAGAGC 

TGGGAAATCA 

CGCGCTGACT 



FIG. 22(cont'd) 



3 



W Jt C 1 

143 1 


CAAGAAGAAG 


CTTGTGGGmC 


Caaggaaac^ 


TTCCCCTGAT 


CATCAAGAAT 


X.50 1 


CTTAAGATAG 


A A 0* A ^* ^P A A 

A AGA C TCAGA 


» — A Ate AM A A A» — « 

tacttacatc 


TG7GAAGTGG 


AGGACCAGAA 


t e c i 
155 1 


□uAboAGGTG 


^» A ».^*0'0 m 0*TA0' 

CAATTGCTAG 


TGTTCGGATT 


GACTGCCAAC 


TCTGACACCC 


lbUl 


ALL T bCTTCA 


0*0*0*0*0**0*1^0*0* 

GGGGCAGAGC 


0* ^LW J* A 0-% kW*% Ate A% _ 

CTGACCCTGA 


CCTTGGAGAG 


CCCCCCGGGT 


165 1 


A J - ~ » A f ^ <~ ^ 0* ~ * 

AGT AGCCCCT 


_p» A A* A^ ^* A a A A 

CAGTGCAATG 


AGATCTTTGT 


GAAGGAACCT 


TACTTCTGTG 


170 1 


GTGTGACATA 


A AM A A a. Jpfe AAA 

ATTGGACAAA 


CTACCTACAG 


AGATTTAAAG 


CTCTAAGGTA 


1751 


AATATAAAAT 


TTTTAAGTGT 


ATAATGTGTT 


AAACTACTGA 


TTCTAATTGT 


180 1 


TTGTGTATTT 


TAGATTCCAA 


CCTATGGAAC 


TGATGAATGG 


GAGCAGTGGT 


135 1 


GGAATGCCTT 


TAATGAGGAA 


AACCTGTTTT 


GCTCAGAAGA 


AATGCCATCT 


1901 


A ^ T * l~ Ik T 0* A 

AGTGATGATG 


AGGCTACTGC 


tgactctcaa 


CATTCTACTC 


CTCCAAAAAA 


195 1 


A A ^ A ^* A A A /~ 


GTAGAAGACl 


CCAAGGACTT 


TCCTTCAGAA 


TTGCTAAGTT 


200 1 




TP 0*~Y0*^0**T*^~T 

TGCTGTG • TT 


AGTAATAGAA 


CTCTTGCTTG 


C T TTGCTATT 


one i 


~ "Ai"~A~/"*A^*A A 

TALALCALAA 


A i** aV m A a A J - • ^ 

AGuAAAAAGC 


TGCACTGCTA 


TACA AGA AAA 


TTATGGAAAA 


. Z 1 0 1 


ATA T TCTGTA 


ACCTTTATAA 


GTAGGCATAA 


CAGTTATAAT 


C ATA A CAT AC 


/ 13 1 


I L» I < • 1 1 1 L 1 


TALTCCACAC 


AGGCATAGAG 


TGTCTGCTAT 


TAATAACTAT 




^ T f* A A A A A T 

IjL 1 LAAAAAT 


TG T G T ACCTT 


A 0* «»» Ate «te Ate - 

TAGCTTTTTA 


ATTTGTAAAG 


GGGTTAATAA 




~ #*- A a X A TT" - /~ 

UuAA 1 A 1 1 T(j 


a Y ^P A A 0* 0* 

ATGTATAGTG 


CCTTGACTAG 


AGATCATAAT 


CAGCCATACC 


-Jill 4 


A i*~ A T T * T ' ^ T A/™ 

ALA 1 ( 1 b 1 ALt 


AGGTTTT ACT 


TGCTTTAA A A 


AACCTCCCAC 


ACCTCCCCCT 


Tit * 


r a a rrTr a a a 
uAALl 1 uAAA 


0* A ^P A A A A ^P 0* A 

CATAAAATGA 


ATGCAATTGT 


TGTTGTTAAC 


TTGTTTATTG 


-t **U 1 


CagCTTataa 


TGGTTACAA A 


TAAAGCAATA 


GCATCACAAA 


TTTCACAAAT 


■J *c < 

1 


aaaglatttt 


^P ^» V ^ A ^* ^P ^ - 

TTTC ACTGCA 


TTCTAGTTGT 


GGTTTGTCCA 


AACTCATCAA 


'i c n i 

£3U 1 


T/~TaT^T^at 
i O 1 A 1 L 1 . A T 


LATGT CTGGA 


TCCTCTACGC 
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